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Complex systems consist of many interacting elements which participate in some dynamical pro- 
cess. The activity of various elements is often different and the fluctuation in the activity of an 
element grows monotonically with the average activity. This relationship is often of the form 
"fluctuations ~ const, x average"", where the exponent a is predominantly in the range [1/2,1]. 
This power law has been observed in a very wide range of disciplines, ranging from population dy- 
namics through the Internet to the stock market and it is often treated under the names Taylor's 
law or fluctuation scaling. This review attempts to show how general the above scaling relationship 
is by surveying the literature, as well as by reporting some new empirical data and model calcu- 
lations. We also show some basic principles that can underlie the generality of the phenomenon. 
This is followed by a mean-field framework based on sums of random variables. In this context 
the emergence of fluctuation scaling is equivalent to some corresponding limit theorems. In certain 
physical systems fluctuation scaling can be related to finite size scaling. 
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1. INTRODUCTION 

Interacting systems of many units with emergent col- 
lective behavior are often termed "complex". Such com- 
plex systems are ubiquitous in many fields of research 
ranging from engineering sciences through physics and 
biology to sociology. An advantage of the related multi- 
disciplinary approach is that the universal appearance of 
several phenomena can be revealed more easily. Such 
generally observed characteristics include (multi-)frac- 
tality or scale invariance 0, , the related Pareto or Zipf 
laws @, 3j self-organized and critical behavior. 

In this paper we study such a general feature related to 
the scaling properties of the fluctuations in complex sys- 
tems. This type of scaling relationship is called Taylor's 
law by ecologists after L.R. Taylor and his influential pa- 
per on natural populations in 1961 The law states 
that for any fixed species the fluctuations in the size of a 
population (characterized by the standard deviation) can 
be approximately written as a constant times the average 
population to a power a: 

fluctuations « const, x average" 

for a wide range of the average. 

The phenomenon was - to our knowledge - first dis- 
covered in 1938 by H. Fairfield Smith [f|, who wrote an 
equivalent formula for the yield of crop fields though his 
paper has, surprisingly, received much less attention than 
Taylor's work. The same relationship was explored re- 
cently by Menezes and Barabasi Q for dynamics on com- 
plex networks, and later termed "fluctuation scaling" § 
in the physics literature. There the temporal fluctuations 
and the averages of the network's traffic were measured 
at the different nodes. 

Despite the analogous questions, the exchange of ideas 
between disciplines is very limited. This review attempts 
to show how general fluctuation scaling is by surveying 
the literature and the current models, as well as by re- 
porting some new empirical evidence and presenting new 
model calculations. We also have the aim to step be- 
yond mere demonstration, and show some basic princi- 
ples that can potentially underlie the generality of the 
phenomenon. 

The paper is organized as follows. Section [5] gives a 
more precise definition of fluctuation scaling. We then 
give a brief overview of empirical results from the liter- 
ature, and also some previously unpublished findings. 1 
Section [3] presents a general mean- field formalism based 



1 Please note that "log" will always denote 10-base logarithms. 



on sums of random variables. This is followed by the in- 
terpretation of the scaling exponent a, and how it reflects 
the dynamics of the complex system. We show that a is 
usually between 1/2 and 1, and that these two limiting 
values can both arise from several, simple types of dy- 
namics. We then show three scenarios how intermediate 
exponents can arise. Because of this multitude of pos- 
sibilities no value of a can be used in itself to uniquely 
identify the internal dynamics of a system, but it is still 
possible to exclude many options that would be incom- 
patible with the observed value of a. The remaining pos- 
sibilities can be narrowed down further by analyzing the 
time scale dependence of a and by the application of our 
mean field framework. The procedure is demonstrated 
on some simple models in Section 21 and the relationship 
between fluctuation scaling, (self-organized) criticality, 
scaling and multiscaling is explored. Fluctuation scal- 
ing can be directly applied to certain physical systems, 
where one finds a strong connection with finite size scal- 
ing. Section [5] gives a general discussion and directions 
for future research. Finally Section [5] concludes. Some 
calculations were deferred to the Appendices. 



2. FLUCTUATION SCALING 
2.1. Basic notions 

Throughout the paper we will always consider some ad- 
ditive quantity /, and the dependence between its mean 
and standard deviation. By dependence we mean the be- 
havior of / over a multitude of observations. Say, if we 
can observe the same dynamical variable in several set- 
tings where it has different means, how does the standard 
deviation change with the value of the mean? 

In order to determine this dependence one needs many 
realizations. These can be simultaneous temporal obser- 
vations for different elements (nodes, subsystems) of a 
large complex system. The measured means and stan- 
dard deviations are then calculated in time, and the sub- 
systems are compared: for subsystems with a larger mean 
/ are the fluctuations larger as well? 

In other cases / is not considered as time dependent, 
only as a fixed value for every subsystem. Then the av- 
erages are taken over an ensemble of subsystems of equal 
size, and the standard deviation characterizes the varia- 
tion of / between subsystems of the same size. 

We just used the expressions "elements", "nodes", "sub- 
systems" and "the same size", but what do these mean? 
Imagine that we want to quantify fluctuations in the traf- 
fic of Internet routers. It is very straightforward to cal- 
culate the mean and the standard deviation of, say, daily 
data throughput, and the question whether routers with 
larger mean traffic exhibit larger fluctuations can be in- 
vestigated. However, routers are not "subsystems" of the 
Internet. They merely represent points of measurement, 
elements of the system. The traffic is formed as a sum 
of data packets that are "extrinsic" to the elements. The 
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packets do not belong to the structure of the network, 
but they carry the dynamics on it. Here we are not in- 
terested in the structure of the routers, i.e., the nodes. 
Instead, the data over which the averages are taken have 
a temporal structure. 

Let us take a different example. Now we want to ana- 
lyze data on the size of animal populations. A population 
can be divided into smaller groups, which then consist of 
individuals, and this gives a true notion of size. Various 
smaller areas can be naively considered as "subsystems" 
with respect to the habitat of the species, for example a 
continent. These subsystems are not structureless, and 
their population comes about as a sum over their smaller 
subgroups of individuals. 

In our review we will call the points of measurement 
as nodes, whether they have a structure or not. The ad- 
ditive quantity under study, be it activity, population, 
traffic or whatever else, will be denoted by fi, where i 
indicates the node of measurement. This will always be 
decomposed as a sum of random variables, which will ei- 
ther represent internal constituents or some events simi- 
lar to the arrival of the extrinsic "packets" to the Internet 
routers. In both cases we will call these the constituents 
of the nodes/signals. Their number for node i will be de- 
noted by Ni, and their respective contributions to / will 
be denoted by Vi >n , where n= 1 . . . Ni. Examples of the 
scheme for building up a system from constituents can be 
seen in Table HI Now we turn to more precise definitions. 

2.1.1. Temporal fluctuation scaling (TFS) 

Let us assume that during an extended period we can 
measure an additive quantity fa within a system at its 
nodes (labeled by the index i). For some finite time du- 
ration [t, t + At) the signal can be formally decomposed 
as the sum 

/*'(*) = E O)- (!) 

71=1 

iV At (i) is the number of constituents of node i during 
[t,t + At). We assume that Vi%(t) > 0, so that the 
time average of /, A * does not vanish. For example, if 
on the stock market during [t,t + At) there are N^(t) 
transactions with the papers of the i'th company, and the 
n'th of those transactions has a value Vj„ (£), then the 
total trading activity of stock i can be calculated by this 
formula. 

The time average of Eq. ([T]) , which we will denote as 
(/ A *), can be calculated as 

1 Q-l 1 Q-lN^(qAt) 

^ q=0 ^ q=0 n=l 

(2) 

where Q = T/At, and T is the total time of measure- 
ment. From the definitions it is trivial that (/ At ) = 



At (/ At=1 ). We will use (fi) without the upper index to 
denote this latter quantity. 

On any time scale the variance can be obtained as a 
time average: 

a?(At) = <[/^f>-</^> a , 

this quantity characterizes the fluctuations of the activity 
of a fixed node i from interval to interval. 

When / is positive and additive, it is often observed 
that the relationship between the standard deviation and 
the mean of / is given by a power law: 

a,(At) oc (/ A <p , 

where one varies the node i, and At is fixed. The de- 
pendence of the right hand side on At is trivial, since 
(/ At ) = At (fi). Thus throughout the paper we will use 
(fi) as the scaling variable: 

a t (At) ex (/ip . (3) 

The exponent ax is usually in the range [1/2,1]. The 
lower index T in the scaling exponent indicates that the 
statistical quantities arc defined as temporal averages as 
inEq. @. 

Finally, if the i-dependence of a and (/) is only mani- 
fested via a well defined parameter of the nodes, such as 
their linear extent (L), area (A), a fixed number of con- 
stituents (N) or some other size-like parameter 5*, then 
we will use this quantity as lower index where possible. 
For example temporal standard deviation will be denoted 
as as- 

2.1.2. Ensemble fluctuation scaling (EFS) 

Again imagine that nodes have a well defined size-like 
parameter S, and it is possible to group them accord- 
ing to that. Furthermore, assume that nodes that fall 
into the same group have equivalent statistical proper- 
ties. Then aside from the temporal average given sepa- 
rately for each node, one can also define the average of 
/ within each group. This is a sort of ensemble average 
over similar nodes, it will be denoted by / At , and it can 
be calculated as 

W=^- E //*(*) ( 4 ) 

Vj:S 1= S 

Both t and At are now fixed, the summation instead goes 
for those nodes i which have a size Si = S, and Ms is 
the number of such nodes. In the notation we will omit 
t for simplicity. Variance is given by 

^ s {At)=Wf-J^ 2 . 
Fluctuation scaling can also arise here in the form 

^(At)«7T E , (5) 
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constituents — > nodes/elements — > complex system 
Vi,n, n = 1, . . . , Ni — * fi, i = 1, . . . , M — * total activity 
a group of individuals — > a population — > a species 
a single tree —> a single forest — > all forests of a continent 
a single data packet — > router — > Internet 
a single car — > measurement point — > highway system 

Table I: Examples of building up a system from its constituents. The symbol — > indicates that the kind of object on the right 
is made up by several ones of the kind on the left. 



where we compare different groups by varying S, while 
At is kept constant. For convenience we follow the con- 
vention of the previous part: On the right hand side of 
Eq. (JSJ) we write fs, which is a short notation for fg' t=1 - 
The scaling exponent «e will always indicate when we 
use ensemble averaging over elements of the same size. 

For data analysis the size S very often corresponds to 
the linear size L, or the area A of the node/subsystem, 
and there the lower index will be changed accordingly. 
For example, the classic study of Taylor Q compares 
areas of different size A, and the measured quantity is 
the population size of a given species in the area. The 
constituents can be smaller groups or, as usually called, 
metapopulations. If one considers the number of groups 
(N) and the size of the groups (V n ) as random variables, 
the total population has the same sum form as before: 

Ni 

fi ^ . Vi,n i 

71 = 1 

which is the analogue of Eq. ([T]) . 

We will call the relationship ([3]) temporal fluctua- 
tion scaling (TFS), and ([5|) ensemble fluctuation scaling 
(EFS). When we do not wish to distinguish between the 
two cases, we will simply use fluctuation scaling (FS), and 
then the exponent will be denoted by a without lower 
index. There exists a large body of results on these sub- 
jects, and the literature is spread over many disciplines. 
Therefore in the following we would like to give a (nec- 
essarily incomplete) overview of the results. A summary 
is presented in Table [TTJ 

2.2. Empirical results: ensemble averages 

2.2.1. Pioneering studies 

As noted in the introduction, the first observations of 
fluctuation scaling appeared in two independent studies, 
well before the widespread recognition of fractality and 
scaling @ . The paper of Fairfield Smith [|| was published 
in 1938, and it was concerned with the yields of agricul- 
tural crops. For a fixed size of land (A) it is possible to 
calculate the average yield f a of a certain type of crop, 
and the standard deviation ~5~a of the yield between areas 



of size A. Then the calculation can be done for areas of 
many different sizes. It was found that there exists the 
power law ([5]) relationship between the two quantities, 
Wa oc /a E , with cue ~ 0.62. 

Taylor's 1961 paper [j| stated the scaling law ((5J for 
systems in population dynamics. Similarly to Fairfield 
Smith, Taylor took an ensemble of areas of the same size, 
and measured the number of individuals of a certain type 
of animal. With increasing area size both the mean and 
the variance of the population grew, with a power law 
relationship between the two quantities. Let us now take 
a closer look at fluctuations in ecology. 



2.2.2. Ecology 

Stable populations in a given habitat fluctuate around 
a typical size called the habitat's carrying capacity 
[ToL Ull HH ■ These fluctuations have a very rich inter- 
nal structure [l2j ■ Both the randomness of birth-death 
processes (a kind of "intrinsic noise") and external cli- 
matic forcing play an important role [l^, EH • The effect 
of climatic factors is so strong that it can synchronize the 
fluctuations of even non-interacting populations (the so- 
called Moran effect) [U G3- To further complicate the 
situation, individuals of a species interact among them- 
selves, just as well as species interact with each other. 
These interactions are non-linear and by now they are 
also commonly recognized to have a significant depen- 
dence on the population density/size. So interactions, 
driving and noise all contribute to population dynamics 
to a certain degree fl2l . [l3j]. This diversity makes any 
ecosystem a showcase of complexity; certain regularities 
are known, but the bigger picture is still missing. 

This is the reason why by discovering a universal law, 
Taylor's paper [f| triggered a growing activity in ecology, 
with literally a thousand publications to date. Taylor's 
results were verified for a wide range of populations, and 
the value of the exponent was predominantly found to 
be 1/2 < «e < 1 uM- Despite its generality, the origins 
of the law and the meaning of a are still much debated. 
Anderson et al. [lj| suggested that the influence of en- 
vironmental fluctuations may be responsible for the ob- 
served non-trivial exponents. The model of Kendal [l~7j 
proposed a dynamics similar to Diffusion Limited Ag- 
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Figure 1: Fluctuation scaling for ensemble averages of the 
population of four species. Every point represents the mean 
/a and variance a\ over an ensemble of areas of the same size 
A The bottom dashed line corresponds to oe = 1/2, the 
top one to ole — 1- Points were shifted both vertically and 
horizontally for better visibility. 



Figure 2: Fluctuation scaling for the cell count of species. 
For every species the average cell count / and its variance 
<r 2 was calculated separately. Then these points were binned 
logarithmically for better visibility, error bars show the stan- 
dard deviation of logo" in the bins, oie = 1. Data courtesy of 
Ricardo Azevedo 12011. 



gregation in which self-similarity give s rise to the mean- 
variance scaling. Another study [l8| proposed that the 
exponents can be described by a class of statistical mod- 
els, which rely on the interplay between the number of 
animal clusters in an area and the size of the individual 
clusters. We will discuss these models in detail in Sec- 
tion [2331 For two comprehensive reviews of these (and 
more) scaling laws in ecological and related systems see 
Kendal [U] and Marquet et al. [H]. 



2.2.3. Life sciences 

There is a number of findings from cellular and molec- 
ular biology regarding FS. Azevedo and Leroi [2(| con- 
ducted a very extensive study of how the typical cell 
count of a species is related to its fluctuations from indi- 
vidual to individual. They found that FS holds over al- 
most 10 orders of magnitude in size, between more than 
2000 species, see Fig. [2] The exponent aE differs among 
tissue types, but for entire organisms its value is approx- 
imately 1. Kendal [2l[ presents similar findings for the 
number of tumor cells in groups of mice, but the expo- 
nents vary. 

Similarly, Kendal [22| analyzed for the most common 
variations in the human gen ome called Single Nucleotide 
Polymorphisms (SNPs) [230 - He found that the mean 
and the variance of the number of SNPs in a DNA se- 
quence scale as different non-trivial powers of the length 
of the sequence, and thus the variance also scales with 
the mean. 



2.2.4. Physics 

FS has been present in the physics literature as well. 
Many extensive quantities are known to have equilibrium 
fluctuations proportional to the square root of the sys- 
tem size, implying the value aE = 1/2 0, This 
relationship is a simple consequence of the central limit 
theorem (see Section EHJ) . Botct et al. [H H3, S|] 
find EFS for a wide range of models, and also for the 
fragment multiplicity measured in heavy-ion collision ex- 
periments. Moreover, a linear (a = 1) relationship was 
found between the fluctuations and mean fluxes of cos- 
mic radiation [29|, [3(j ■ Here the ensemble is formed by 
cutting a single time series into pieces, and then periods 
with higher average activity exhibit higher fluctuations. 

2.3. Empirical results: temporal averages 

We will now turn to temporal FS. For the collection 
of such data it is necessary to have multi-channel mea- 
surements, simultaneously monitoring the behavior of a 
range of elements i. With the unbroken growth of com- 
puting infrastructure, many technological networks now 
offer appropriate datasets, several ones publicly available. 



2.3.1. Complex networks 

Menezes and Barabasi [3, HH , in part inspired by Tay- 
lor's original paper, found TFS for several complex net- 
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Figure 3: Fluctuation scaling for the traffic of Internet routers. 
For every router the temporal average of weekly data traffic 
(fi) and its standard deviation <Ji was calculated separately. 
Then these points were binned logarithmically for better vis- 
ibility, error bars show the standard deviation of log a in the 
bins. The fitted exponent is q:t = 0.75. Data courtesy of 
Jordi Duch and Alex Arenas [32l |. 



works. A good example is the analysis of Internet traf- 
fic, which was later revisited by Duch and Arenas [32| . 
In their study they analyzed the traffic of the Abilene 
backbone network. The nodes i correspond to routers, 
and the mean and variance of their data flow was calcu- 
lated. In Fig. [3] we show their results for weekly data 
traffic, the best fit is achieved with ax ~ 0.75. Mcnczcs 
and Barabasi also analyzed web page visitations, river 
flow, microchip logical gates and highway traffic. They 
proposed that the datasets should fall into two "univer- 
sality classes" with ax = 1/2 and 1. There also exists 
a growing body of literature on transport processes on 
networks, and the scaling of fluctuations in such systems 



2.3.2. Ecology 

Ecologists have made many advances regarding TFS as 
well, but the literature is far from unequivocal. The basic 
concept is to monitor many populations of a given species 
for an extended period of time, and then for each pop- 
ulation i calculate the temporal mean (fi) and standard 
deviation Ui of abundance. These are typically power law 
related according to TFS, examples are shown in Fig. 2] 

Classical population dynamics offers several bench- 
mark models [39|, 0, HH, but simple deterministic and 
Markovian models cannot explain the observed ax values 
between 1/2 and 1. After a range of small populations 
where they show realistic behavior, they cross over to ei- 



Figure 4: Fluctuation scaling for temporal averages of the 
population of three species. A point represents the temporal 
mean (fi) and standard deviation at of a population. The 
bottom dashed line corresponds to qt = 1/2, the top one 
to qt = 1. Points were shifted both verticall y fo r better 
visibility. Data courtesy of Marm Kilpatrick [37l. l38j. 



ther cyt = 1/2 or 1 j4^|. The model of Kilpatrick and Ives 
[37I ] suggested that the interaction between species and 
feedback mechanisms between their fluctuations can give 
rise to any value of qt. Perry proposed an even simpler 
chaotic model J43|. Both of these models can yield vari- 
ous exponents, but still only when populations are small 
enough. 

There have been several findings for plant species. In 
a series of papers Ballantyne and Kerkhoff showed that 
the reproductive (yearly seed count) variability of trees 
follows TFS with ax = 1- The same value is supported 
by the Satake-Iwasa [HI forest model. There the trees 
are modeled by interacting oscillators which synchronize 
above a critical value of the coupling [4{| |4(| . The syn- 
chronization transition coincides with a transition from 
cit = 1/2 to ax = 1. 2 

We will now briefly describe their empirical study [47[ • 
The dataset [48| consists of yearly observations of the 
seed production of trees throughout the Northern Hemi- 
sphere. In particular, we considered three subsets of the 
dataset [4!|, those collected by Tallqvist (50j . Franklin 
[5~fl | and Weaver and Forcella [|52|], including 4—17 years 
of observations for 44, 148 and 28 sites, respectively. The 
fits for TFS are given in Fig. [5{left). The exponents for 
the three subsets were found to be ax = 0.97, 0.93 and 
0.90. Given the quality of the fits it is not possible to 



2 Similar synchronization mechanism has also been observed in the 
reproduction of animals [ill . 
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outrule that for all three datasets ax = 1 (as suggested 
by Ref. [13 )• However, here we make an attempt to give 
an argument that predicts otherwise and can be tested. 

Simulations of the Satake-Iwasa model already sug- 
gested that long-range synchronization can cause ax > 
1/2, and the presence of such a tendency is well known for 
trees. Koenig and Knops [48| conclude that there exists 
a significant positive correlation between the reproduc- 
tive activity of trees for distances longer than 1000 kms 
(this phenomenon is called masting in the ecology litera- 
ture). While Ref. [48[ is much more precise and detailed, 
we also outline a simple measurement: in Fig. EJright) 
we plot the average C(r) cross-correlation coefficients be- 
tween the sites in the complete dataset as a function of 
the distance r of the sites. As expected, we find that 
cross-correlations decay very slowly with distance, and 
the dependence can be fitted approximately by 

C(r) ocr-°- 40 , (6) 

(although admittedly the fit is not perfect). Sections 
13.3.41 and 14.41 will show, that while perfect synchroniza- 
tion a'la Satake-Iwasa leads to ax = 1, partial synchro- 
nization with the above power-law correlations, implies 
or = 1 - 0.40/2 = 0.8, sec Eq. {20]). A better quan- 
titative agreement would warrant larger datasets which 
are not available at present, but there have been some 
promising attempts along the same lines [46| . 



2.3.3. Life sciences 

Keeling and Grenfell [H[ suggested TFS for the size of 
epidemics, and found both empirically and by a simple 
Markov chain model of population dynamics that vacci- 
nation in general decreases not only the size of epidemics 
but also the value of ax- TFS was later found by Wool- 
house et al. [HI to also hold between different pathogens. 
TFS has been found in the cell-to-cell variation of pro- 
tein transcription by Bar- Even et al. [HI, albeit with a 
crossover and a rather narrow range. 



2.3.4. Stock market 

In this section we summarize the results of a series 
of papers JR [H, US HH . The work was based on a TAQ 
database [59| , recording the transactions of the New York 
Stock Exchange (NYSE) for the years 2000 - 2002. Very 
similar results were obtained for the NASDAQ and Chi- 
nese markets [60j |. 

We define the activity of stock i as its total traded 
value, given as 

2V 4 A *(t) 

//"(*) = E 

n=l 



where N^ft) is the number of transactions of stock i in 
the period [t, t+At). The individual values of these trans- 
actions are denoted by Vf£(t). Data were detrended by 
the well-known [/-shaped daily pattern of traded volumes 

Then the measurement of mean and variance was car- 
ried out. The exponent ax shows a strong dependence on 
the window size At, we will return to this result in Sec- 
tion 13.3.11 The values range between ax = 0.68 — 0.87, 
see Fig. [B] 

When At is very small, Ref. @ shows that the indi- 
vidual transactions can be treated as independent events. 
Moreover, for the large enough stocks the average size of 
transactions ((V*)) can be calculated as a power of the 
mean number of transactions as 

(Vi) (x (TV/ , 

with (3 sa 0.65 §. Equivalents of this property recur 
in several FS-related contexts. We will devote Sections 
!3.3.2ll3~3~3"l to this observation, which we will call impact 
inhomogeneity Q. We will also show how to map the 
value of (3 onto non-trivial ax values. By that method 
[see Eq. (|18p] the corresponding ax value should be 0.70, 
which is very close to the actual value ax (At — » 0) = 
0.69. 

Another general observation Q is that if ax is a func- 
tion of At, then FS can only hold, if this dependence is 
logarithmic (cf. Section l3.3.ip . For the stock market this 
is true in two distinct regimes and those are separated by 
a crossover. For At < 10 3 sec (— sign) and At > 3 x 10 4 
sec (+ sign) one finds 

a>T,±(At) = a* T ± + 7± log At, 

with 7_ ?s 0.00, and 7+ w 0.06. On the other hand, the 
Hurst exponent Hi can be defined as [l], [6l| 

MAt) = ([/^(t) - </, At (t))] 2 ) 1/2 cx At* (7) 
For NYSE this equation is found to be valid with 
H i , ± = H* ± + 1± log(fi). 

Lower indices indicate the same two regimes, and 7± have 
the same values as for ar [H • 

2.4. New empirical results 

In this section we present previously unpublished re- 
sults for fluctuation scaling. Note that temporal vari- 
ances were estimated by the par tition function of De- 
trended Fluctuation Analysis [61(. This was necessary 
in order to (at least partly) remove the nonstationarity 
from the datasets. All results, including the values of ax 
agree qualitatively with those obtained from a direct cal- 
culation variance without detrending, but the accuracy 
of the estimation is improved. 
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Figure 5: (left) Fluctuation scaling for the yearly seed count (reproductive activity) of trees from three studies. The fitted 
exponents are qt = 0.90, 0.93, 0.97. Points were logarithmically binned and log cr was averaged for better visibility, error bars 
show the standard deviation of logo - in the bins. The estimates are close to, but below 1. (right) The average cross-correlation 
coefficient between pairs of sites as a function of their distance. Pairs of sites were binned to have an equal number per bin, 
error bars show the standard deviation of the cross correlation coefficients in the bins. The red line is a fit with the power law 
C(r) oc r~ ' 40 , from the fit the error of the exponent is ±0.06. 



2.4-1- Stock market (ensemble averaging) 

Fluctuation scaling in stock market data has just been 
discussed, but those earlier results pertained temporal 
FS, whereas here we will present some new findings on 
ensemble FS in the same dataset. 

We again consider the (daily) trading activity of stocks. 
The size of companies is often measured by the total 
value of all their issued stocks, called the company's cap- 
italization (C). We take a fixed time period, the day 
03/01/2000 (the results are similar for other days). Then 
we group the stocks according their capitalization into 35 
logarithmic bins. Finally, we calculate the mean fc and 
the standard deviation Wc of the activity in every group. 
EFS is shown in Fig. The fit gives a E « 0.8 - 0.9, 
although with some deviations from scaling. 

If the size of the animal population is extensive, it is 
justifiable to use the size of the area to parametrize the 
ensemble: its mean should be exactly proportional to 
the area. To use capitalization as a paramctrization for 
company size is a different matter. While it is indeed 
strongly related to the mean trading activity, there is 
no one-to-one correspondence between the two [57| . This 
means that companies of the same capitalization can have 
different expected trading activities. Thus our ensemble 
averaging technique is only approximate. 

To circumvent this problem one can apply the follow- 
ing trick. The trading activity of companies fluctuates 
strongly from day to day, but the expectation value of 
the distribution is rather stable over time. So let us now 
take an interval t = 1 . . . T and calculate the time aver- 
ages (fi(t)) during this period for each stock. Then for 
every stock take the single value fi(t = 1) only, and group 



the observations according to (/j). This is equivalent to 
the measurement before, only instead of Ci, groups are 
formed with respect to (/J (15 logarithmic bins). Then 
the ensemble mean / and variance a can be calculated 
in each group. The results from this technique are also 
indicated in Fig. \7\ one can see a dramatic decrease in 
the noise level, while the value of the exponent is approx- 
imately preserved, qe = 0.89. 



It must be emphasized that we use information about 
the temporal average only for the grouping procedure. 
The measured «e is a true EFS exponent. Moreover, the 
value of ctE that we find here is much larger than cvt (cf. 
Section f2.3.4[) . Because of the intricate statistical prop- 
erties of markets, the two exponents cannot be expected 
to coincide. 3 



This example shows that the success of ensemble fluc- 
tuation scaling crucially depends on the proper choice of 
the size parameter. In the cases where possible, physical 
size or area are good choices, because they are known to 
be extensive. Otherwise the above trick can be applied, 
but only if multiple observations are available for every 
node and the system is close to stationary. 



The same two averaging techniques (fixed time and an ensemble 
of stocks versus a fixed stock and multiple times of observation) 
were previously introduced for stock market price changes in Ref. 
& 
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Figure 6: The dependence of the FS exponent in stock market 
data on the size of the time window At. The dependence is 
logarithmic in two regimes, with the coefficients 7_ w 0.00 
for At < fO 3 sec and 7+ ss 0.06 for At > 3 x f0 4 sec. The 
crossover regime corresponds to the time scale of one trading 
day. Inset: FS for the fluctuations of the traded value of stocks 
for the window sizes At = 10 sec and At = 1 month. Points 
were logarithmically binned and log a was averaged for better 
visibility, error bars show the standard deviation of log a in 
the bins. 
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Figure 7: Fluctuation scaling considered as an ensemble av- 
erage over NYSE stocks for the trading activity. Calcula- 
tions were done both by considering the capitalization d 
(03/01/2000: ■), and the monthly average trading activity 
(/«) (03/01/2000: 0,01/02/2000: A, 01/03/2000: v) as sur- 
rogates for size. One can see that using {/;) for the formation 
of groups largely reduces fluctuations, but the exponent re- 
mains similar. 



Subj. 


System 


T/E 


Refs. 


r orks 


Random walk 


T 


[7,31, 33] 


Network models 


T 


[34, 35] 


t> 
+^ 


Highway network 


T 


[7,31J 




World Wide Web 


T 






Internet 


T 


[1,31, 32] 




Heavy ion collisions 


E 


[26, 27, 28] 


Pn 


Cosmic rays 


E 


[29, 30] 




Stock market 


T 


[8, 56, 57, 60] 


O 



Stock market 


E 


this review 


~\ 
cj 


Business firm growth rates 


E 


[62, 63] 


O 
CO 


Email traffic 


T 


this review 




Printing activity 


T 


this review 





River flow 


T 


[64, 65] 


Precipitation 


T 


[66] 


a 


Forest reproductive rates 


T 


[46,47] 




Satake-Iwasa forest model 


T 


[45] 


d 




Crop yield 


T 


m 


logy/p 


Animal populations 


T, E 


[5,10, 15, ^6J 


Diffusion Limited population 


E 


[12 




H 


Population growth 


T 


[67, 68] 


Exponential dispersion models 


E 


[18, 21, 69] 




Interacting population model 


T 


[3JJ 




Cell numbers 


E 


[20] 


03 


Protein expression 


T 


[55] 


<D 
CJ 

a 


Gene expression 


T 


[JO, UJ 


a 
'0 


Individual health 


E 


[72] 


, v 


Tumor cells 


E 


[21] 




Human genome 


E 


[22, 23] 




Blood flow 


E 


[69] 




Oncology 


E 


[21] 




Epidemiology 


T 


[53, 54] 



Table II: A list of some studies where fluctuation scal- 
ing/Taylor's law was directly applied or implied by a similar 
formalism. Groups were assigned by subject areas, Phy. = 
Physics, CI. = Climatology. The column T/E shows the type 
of fluctuation scaling, T: temporal, E: ensemble. 



2.4-2. Human dynamics 

The analysis of the records of human dynamics has re- 
cently seen growing interest [zD, [7f| [76| . Here we discuss 
two large technological databases of human activity: 

(i) Emails from the employees of the company Enron 
during the year 2000. We used a filtered variant of 
the original dataset posted by the Federal Energy 
Regulatory Commission [jjj]. We defined /j At (t) as 
the number of emails sent by the person i during 
the interval [t, t + At). 

(ii) Data on the printing activity of the largest printer at 
the Department of Computing at Imperial College 
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London (78J. The files include the complete year 
2003, we removed weekends, official holidays and 
closure times of the computer laboratory (23:00- 
7:00). We included 987 users who submitted at least 
3 documents during our analysis, except the single 
largest user who appeared to have different statisti- 
cal properties from the rest. fi(t) is defined as the 
number of documents submitted to print by user i 
in the time interval [t,t + At). Further details on 
the dataset can be found in Ref. [79| . 

Note that multiple copies of the same email/document 
sent/submitted simultaneously were counted as one. 

We are going to present these two datasets side by side, 
because they have strong similarities. Both show TFS for 
window sizes At = 5 . . . 2.8 x 10 4 sec. The exponent varies 
between a T = 0.52 - 0.72 (email) and a T = 0.57 - 0.83 
(printing). Fig. [5] shows the fits for time window sizes 
At = 10, 10000 sec. We find scaling over about 2.5 orders 
of magnitude, and the exponent depends on At, as shown 
in Fig. [HI Despite the level of the noise in the data, the 
dependence appears to be monotonically increasing, with 
two regimes separated by a crossover near At ~ 4000 sec 
(email) and At ~ 1000 sec (printing). The dependence 
is close to logarithmic, with the same form as for stock 
markets (the index — corresponds to the regime below, 
+ above the crossover): 

ar,± ( At) = atp ± + 7± log At, 

with 7™ lail ps 0.04, 7 ° mail PS 0.13, 7 print ps 0.09, and 
7^""* ss 0.02. Also for the Hurst exponents 

ffi;± = ffl+7±l0g (fi). 

The coefficients are 7 i mail ps 0.04, 7 ™ lail ps 0.11, -y* lint ps 
0.07, and 7^"' ss 0.01. The cr,(At) scaling plots are 
shown in Fig. [TUl and the Hurst exponents' dependence 
on (/) in Fig. fTU 

Finally, notice that for email data ax(At) tends to 1/2 
with decreasing window size, and the logarithmic ten- 
dency appears to saturate. On the other hand, for print- 
ing data the logarithmic tendency is markedly present 
even for short times. By an extrapolation from the trend 
one expects that «t(1 sec) ps 0.51. Section 13.2.11 of- 
fers an explanation why for very short times one expects 
qt = 1/2 in these datasets. 

2.4-3. Precipitation 

In this section we present a study (66| of the weekly pre- 
cipitation records of 22928 weather stations worldwide. 
The dataset was obtained from the Global Daily Clima- 
tology Network (GDCN) [80[. For one station, typically 
40 years of data is available between 1950 and 1990. TFS 
is found with ax = 0.77, see Fig. [T21 However, there are 
also some significant deviations. These can be interpreted 



based on geographical information: For every station, the 
geographical latitude (U, measured in degrees) and the 
height hi measured from sea level was known, and the 
multiple regression 

log cr, = C + a T log {fi) + Ci \k\ + Chh, + a (8) 

with an error term ei yields the results C = 0.896±0.002, 
a T = 0.732 ± 0.002, Q = -(8.79 ± 0.05) x 10~ 3 and 
Ch = (— 6±l)xl0~ 6 . All values are significantly different 
from zero at the 99.98% confidence level. For the single 
parameter fit of TFS, R 2 = 0.73, while for the multiple 
regression © one finds R 2 = 0.90 which is a substantial 
improvement due to the inclusion of gcogrcphical lati- 
tude 4 and, in smaller part, to the height above sea level. 
The remaining error term, although we did not find any 
appropriate explanatory variable, is still not unsystem- 
atic. By plotting e, on a map (see Fig. [13]) one finds a 
strong geographical clustering with q > typically, but 
not exclusively, in continental areas. This systematic ten- 
dency suggests that a well-defined origin might exist for 
such corrections. 

As for the value of qt, its origin will be analyzed in 
a later study (6r| . From preliminary studies it appears 
that it is not strongly dependent on the choice of the time 
scale At, and it is always significantly different from both 
1/2 and 1. 

2.5. Corrections to fluctuation scaling 

Similarly to precipitation data, there are significant 
corrections to TFS also in stock markets. These are re- 
lated to differences between market sectors. In Fig. [14] 
we plot Oij {fi) aT versus (fi) to characterize corrections 
to fluctuation scaling. By highlighting the alignment of 
three industrial sectors one can see that they form clus- 
ters, so the deviations are - to some degree - systematic. 

In most real systems fluctuation scaling is found rather 
as a general tendency than an exact law. The scaling 
plots have some broadening, which can have several ori- 
gins. A possible origin of poor fits can be the presence of 
crossovers and breaks in the scaling plots 0, 0, S3, H(| , 
although these can only be seen clearly in very few stud- 
ies [2(1 64]. In other cases the deviations are attributed 
to the quality of data and short sampling intervals [13] • 
Still, these corrections can be large and systematic. In 
Section 12.4.31 we showed that for precipitation fluctua- 
tions geographical location and height plays a role. Sim- 
ilarly, for the stock market the market sector matters. 
These effects could be uncovered, because of the availabil- 
ity of these independent quantities for each station/stock. 



4 Climatic fluctuations are well known to strongly affect ecological 
fluctuations [Til . 1 1 2f . It is an interesting fact that the variability 
of animal populations can also depend on latitudinal position 
(see Ref. |81| and refs. therein). 
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Figure 8: Fluctuation scaling for time resolutions of At — 10 sec and At = 10000 sec. Points were logarithmically binned and 
logo" was averaged for better visibility, the error bars represent the standard deviations inside the bins, (left) Results for the 
number of sent emails, (right) Results for number of printed documents. 




Figure 9: (left) The dependence of the FS exponent in the Enron email database on the size of the time window At. The 
dependence is logarithmic in two regimes, with the coefficients 7_ w 0.04 for At < 4000 sec and 7+ ~ 0.13 for At > 4000 sec. 
(right) The dependence of the FS exponent in printing data on the size of the time window At. The dependence is logarithmic 
in two regimes, with the coefficients 7- « 0.09 for At < 300 sec and 7+ w 0.02 for At > 1000 sec. 



In some models they can even be calculated analytically, 
see Section 14.1.2 1 



The fact that scaling is mostly very well preserved sug- 
gests that the investigated complex systems have a ro- 
bust dynamics characterized by a value of a, and that 
the role of the corrections is not substantial in the for- 
mation of fluctuations. There is wide consensus that the 
exponents are meaningful, and not significantly distorted 
by the non-scaling corrections. 



2.6. Summary of observations 

In sum, fluctuation scaling appears to be a surprisingly 
general concept that can be recognized in virtually any 
discipline where the proper data are available. The fluc- 
tuations of positive additive quantities appear to have 
the structure 

fluctuations = const, x average" x (1 + corrections). 

There is immense literature on the origins of fluctua 
tions in various systems, ranging from gene networks 
through complexity (31| to animal populations [ill . 



Il3l . [84J. The common point of all these works is that 
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Figure 10: Scaling plots of log a — 1/2 x log At versus log At generated by Detrended Fluctuation Analysis. Users were grouped 
by their average activity into three groups ({/} increasing from bottom to top, see plot for ranges) and the curves were averaged 
within groups. A horizontal line would correspond to complete the absence of correlations, and the slopes of the linear regimes 
are H — 0.5, where H are the typical Hurst exponents of groups, (left) Results for Enron email data. For shorter time windows 
At < 4000 sec, correlations are weak, and their strength increases slowly with greater (/). Then after a crossover regime, for 
At > 4000 sec correlations become stronger, with larger difference between the three groups, (right) Results for printing data. 
For shorter time windows At < 300 sec, positive correlations exist, and their strength increases with greater (/). Then after a 
crossover regime, for At > 1000 sec correlations become very weak for all three groups. 




Figure 11: The dependence of the Hurst exponent of / on (/). Points were logarithmically binned and H was averaged for better 
visibility, the error bars represent the standard deviations inside the bins, (left) Results for Enron email data. When At < 4000 
sec (■), depend weakly on (/), the coefficient of the logarithmic trend is 7_ = 0.04, correlations are weak, Hi-- « 0.5 — 0.55. 
For time windows At > 4000 sec (O), correlations become stronger, 7+ = 0.11, but there are strong fluctuations around the 
logarithmic trend, (right) Results for printing data. When At < 300 sec (■), the Hurst exponents depend logarithmically on 
(/) with a coefficient 7_ = 0.07. For time windows At > 10 3 sec (O), correlations are nearly absent, the respective Hurst 
exponents are Hi-+ ~ 0.5 — 0.55, and the logarithmic dependence is weak, with 7+ = 0.01. 



fluctuations originate from two factors: internal and ex- further analogies and to build a broader picture, 
ternal. Naturally, the dynamics, the structure, and the 
interaction of the nodes vary from case to case. We ex- 
pect that, e.g., the stock market trading activity and the 
reproduction of trees is very different. The discovery of 
FS as a common pattern can be a good start to point out 
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3.1. The components of fluctuations 




Throughout this section we will analyze the fluctua- 
tions of quantities of the form |T]), as measured by the 
standard deviation/ variance. Thus, it is important that 
there exists a simple analytical expression |58| for 

^ = ([fi(t)-{fi(t))} 2 )- 
Appendix [5] gives a proof that 

al = ^ Vi {N^)+^ m m\ (9) 

where (Vi) and Yj Vi are the mean and the variance of 
Vi, n . Similarly, (Ni) and E 2 ^ are the mean and variance 
of Ni. We also introduced the Hurst exponent Hy% of the 
constituents, which is defined as 



Figure 12: Fluctuation scaling for the weekly precipitation of 
weather stations. The fitted exponent is qt = 0.77. Points 
were logarithmically binned and log a was averaged for better 
visibility, the error bars represent the standard deviations in- 
side the bins. The inset shows the same plot and the same axis 
range, but without binning. One can see that there is a high 
number of outliers, due to other, {/}-independent corrections 
to FS. 
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for any t and At. If for a fixed i and t all Vi n (t) are 
uncorrelated, then Hy = 1/2 , while if they are long range 
correlated Hy > 1/2 [gUsH. 5 



3. A GENERAL FORMALISM 

In the following we will focus on the temporal variant 
of FS. In many cases the results continue to apply by 
simply dropping the time index t and averaging over an 
ensemble of systems. 

The previous section reviewed ample evidence that FS 
emerges in a very broad range of areas. Here we attempt 
to describe many of these by the same formalism. In the 
following we will assume that the systems are stationary. 
When considering a node i, its activity fi will always be 
decomposed as a sum. In some cases this means summa- 
tion over the node's internal constituents, as for forests 
where reproductive activity was the total of that for all 
trees. In other cases the nodes themselves are simple, 
and the signal is the sum of events at the nodes, like the 
passing of cars at counting stations. If in the time inter- 
val [t,t + At) there are Aj At (i) such constituents/events, 
and the activity of the n'th contributes V/^(t) to the 
total activity fi, then 

A t At (t) 

/**(*) = £ O)- © 



3.2. "Universality" classes 

In this formalism it is relatively easy to show that there 
exist two important classes of systems, one with a. — 1/2 
and one with a = 1. The existence of such classes was 
pointed out, for example, by Anderson et al. (l6j and 
later by Menezes and Barabasi 0]. 

These are not universality classes but rather simple 
limiting cases, and a is not a universal exponent in the 
usual sense of statistical physics J24|. Many empirical 
systems do not belong to either class, and both a = 1/2 
and a = 1 can arise from several types of dynamics. In 
order to make FS a truly useful tool in the analysis of 
empirical data, one needs a classification scheme for how 
different types of dynamics can be mapped onto a. Our 
current understanding of such classification is outlined in 
this section. 



3.2.1. The case a = 1/2 

We will now present two scenarios that can give rise to 
a = 1/2. The arguments will be given in the language of 
time averages, but they can be generalized to ensemble 
averages in a straightforward way. 



These V's do not necessarily have to be independent 
EH ! but we will assume that their (unconditional) dis- 
tribution does not depend on n. We will omit the index 
At, where appropriate. 



5 The Hurst exponent is only related to correlations in this simple 
fashion because the distribution of V; n (t) does not depend on n 



14 




£ 0.5 

_v 

g 5 0.0 
-0.5 



~\ Closed-end Investment Companies |~ 


' Electric services ' 


Finance companies 


- 




□ 

- 






□ 


n □ 


□ 

□ n 


□ 

□ 

□ □ 


□ 


□ □ 


□ 

□ □ □ 




H * fib 


□ □ 










™° * 




2 4 6 8 


2 4 6 8 


2 4 6 8 


log <f> 


log <f> 


log <f> 



Figure 14: log[<7i/ {fi) aT ] plotted versus log (fi) for the 10-second resolution traded value data of stocks. The data points were 
not binned or altered in any way, which makes visible the deviations from the original scaling law, which would correspond 
to a horizontal line. The lighter points indicate all stocks, while boxes (□) highlight the distribution of points for the three 
indicated economic sectors. The great degree of clustering both horizontally and vertically. Clustering along the log (/) axis 
only suggests that the sector has some typical trading activity. Systematic corrections to fluctuation scaling are indicated by 
clustering along the log a/ {f} aT axis. If a sector is clustered in the lower/higher half of the dataset, it means that its trading 
activity has typically lower/higher fluctuations than the market average. The presence of such sector dependent clustering 
suggests that the corrections to fluctuation scaling are not purely random. 



1) Let us assume that every node i consists of a fixed 
number Ni(t) = Ni of constituents, each with a 
signal Vi !n (t) which is i.i.d. for all i, n and t, with 
the same mean (V) and variance £y. From Eq. © 



it is trivial that here of 
linearity of the mean, (fi 



NiJ^y. Because of the 
= Ni (V) , so 



and hence a = 1/2. Of course in this simple case 
one can say more, because the central limit theorem 
[13 is applicable: 



T,nUVi,n(t)-Ni(V) 



Qi{t), 



ill 



where Gi(t) are i.i.d. standard Gaussians and "— >" 
means convergence in distribution for Ni — > oo. 
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Exactly the same equation can be rewritten to more 
resemble fluctuation scaling: 



Mt) - (fi) 
K(h) a 



Gi(t). 



(12) 



The power a = 1/2 is exactly the power in FS, and 

— 1/2 

K = Sy (V) . The conceptual difference is only 
that since we know that (fi) = Ni (V), we can use 
(fi) as a surrogate variable for JV». This is 

very useful, when we only have the time series of 
fi(t) available but not N, since the limit can be 
switched to (/) — > oo (cf. Section l5T2|) . 

1') Let us consider an example for scenario 1): a sys- 
tem, where V^ n (t) can only be 1 with probability p 
and with probability 1 — p. Scenario 1) still ap- 
plies because Vs are i.i.d., so a = 1/2. This binary 
distribution can be instructive, as one can think of 
Vi tTl (t) as independent indicator variables. For ex- 
ample let us take a volume S of ideal gas within 
a large container. Let the whole system contain 
N gas atoms, and Vi : „ = 1 if the ra'th atom is in 
the container, while Vi iTl = if it is not. The ideal 
gas is homogeneous and the atoms are independent, 
every atom having a probability p oc S of being in 
the small container. From here, one can apply the 
above argument to show that for various containers 
of different sizes 

os « (fs) 1/2 ■ 

These are the well-known square-root type fluctu- 
ations of equilibrium statistical physics (see, e.g., 
Section XII of Ref. [25|]). Similar arguments were 
suggested for the number of animals in an area: If 
the motion of individuals were independent (gas- 
like), then their spatial density fluctuations should 
follow a = 1/2 HI]. 

1") The example of the ideal gas can be given in the 
language of ensemble averages as well. Simply we 
take a large number of containers of the same size 
S and calculate the mean fs and standard devia- 
tion ~o~s of atom counts between these containers. 
Then we vary the container size, and we recover an 
analogous relationship: 

— -r 1/2 

Of course, this was expected, because the system 
is ergodic, so temporal and ensemble averages are 
equal. 

2) For an even simpler mechanism let us recall the 
findings of Section 12.4.21 We found that for very 
short times (At ~ 1 sec) the number of sent 
emails/printed documents follow TFS with = 
1/2. It is highly unlikely that someone will send 



several different emails/print several different doc- 
uments in the same second (duplicates of the same 
email to multiple recipients were excluded). Thus 
fi(t) = or 1, and so fi(t) = ff(t). Remember 
that this is very different from the previous exam- 
ple, where fi(t) was allowed to have any value, and 
only Vi tn (t)'s were constrained to or 1. 

In the email/print data the number of events per 
second was very low, generally (fi) < 4 x 10~ 3 
sec -1 . The standard deviation is then 



(fi) (fd 2 = (M (fi) 2 * (fi) 



(13) 



so a = 1/2. The same argument holds for the num- 
ber of trades per second in the stock market [8(. 

The meaning of this scenario 2) in summary: We 
are examining the system on such a short time scale 
that no two events happen in the same time win- 
dow. Then, the FS exponent tells us nothing about 
the dynamics of the system, because a = 1/2 is au- 
tomatically true. 



3.2.2. The case a = 1 

We will now present two scenarios that can give rise to 
the value a = 1. While 1) is only valid for TFS, 2) can 
be readily generalized for EFS as well. 

1) It was possible to obtain a = 1/2 by sums of in- 
dependent y's. In the other extreme case, if ev- 
ery node i had a fixed number of identical and com- 
pletely synchronized constituents, i.e., Ni(t) = Ni and 
Vi,„(t) = Vi(t) Eq. Q} simplifies to 



fi(t) = Y,Vi, n (t) = NiVi(t). 



Then <£(*)) = Ni (V t (t)), and <7< = N^ Vl . 



— If 
°i {Vi) {fi, 



(fi, 



(14) 



with a = 1 . The last proportionality only holds if the 
ratio Tivi/ (Vi) is the same for any i, for example when 
the distribution of Vi is independent of i. 6 

1') How is such an argument of any use? The study of 
Cho et al. [89[ reports experimental data on samples 
of yeast, in which cells were artifically prepared to 
have almost perfectly synchronized cell cycles. The 
measured signal fi(i) is the hourly expression level 
of various genes i in a sample. If all cells of yeast 



6 If the dependence is present but weak, then it may cause correc- 
tions to FS, but scaling should still hold approximately. 
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contribute in the same way to the measured expression 
level, and they are synchronized, then the value ax = 
1 is simply an indicator of such a synchrony. Thus FS 
probably tells us nothing about the dynamics of gene 
transcription, and the exponent is simply due to the 
sample preparation. 

Nacher et al. [7(| propose a stochastic differential 
equation model that predicts the same exponent ax = 
1 for this dataset (ax = 1 is confirmed by Zikovic et 
al. [7l|). They argue that self-affme temporal corre- 
lations are the origin of such a value. Section 13.3.11 
will show that self-affme temporal correlations do not 
contribute to ax in this way. Instead, our above ex- 
planation is simpler, and it suggests that the dataset 
cannot be used in favor of any proposed model based 
on the value of ax- 

1") Real systems are often not closed, but subject to out- 
side forces. In certain cases this driving can be so 
strong that it can overwhelm the internal dynamics. 
If the internal structure of the system becomes ir- 
relevant, this must also have an effect on FS. There 
have been a number of studies discussing how fluc- 
tuations in complex systems are formed as the sum 
of internall y ge nerated and externally imposed factors 
84 [.Anderson et al. [3 and Menezes 



luiiauau 

and Barabasi 0, [3l[ suggested that ax = 1 can arise 
when the external driving force imposes strong fluctu- 
ations in either Vi{t) or N t (t) (cf. Ref. [III). 

When all Vi, n (t) (the signals of every constituent at 
every node) become synchronized, then we are back 
at scenario 1): a = 1, because fi(t)/ (fi) = V(t)/ (V) 
which has a universal, z-independent distribution. 

It is also possible that an external force W(t) affects 
the number of constituents in the elements so strongly 
that the fluctuations of Ni(t) become proportional 
only to this force. In this case iVj(t) = AiW(t), where 
At are node-dependent constants. One expects that 
generally (f t (t)) = A, (W(t)) (V t ), whereas 



{Vi? 



tfn (N) 



Af^ w (Vi) + (W) 



If fluctuations in W are so large that (W) <C S^, then 
only the first term remains. After some algebraic steps 



Y 2 



mm 



(fif CC (fi) 



2rvi 



with ax = 1. The last proportionality is true if the 
distribution of Vi iU does not depend strongly on i. 

2) a = 1 can be a sign of a universal distribution of 
fi{t)l (fi), which only varies by a constant multiplica- 
tive factor throughout nodes. If this is true, then 
fi(t) can be decomposed into this factor Fi, and the 
universal random variable Vi(t), which are identically 
distributed for all i. Naturally (fi) = Fi(V), and 



FfTfy, and a t 



^v(vy l (h). 



3.3. Other values of a 

It has been observed that many real systems obey FS 
with a values that significantly differ from both 1/2 and 
1. In this section we summarize the current knowledge 
of general mechanisms that can give rise to intermediate 
values. 



3.3.1. The dependence of a. on the time resolution At 

First of all, a can depend on the size of the time win- 
dow used for its measurement. This phenomcnological 
picture can be used to understand the results of Section 
12.3.41 for stock market trading, and Section 12.4.21 for hu- 
man activity. 

Let us assume, that the activity time series are long 
time correlated with Hurst exponents Hi that are allowed 
to depend on the node i. The Hurst exponent of the time 
series /j A *(i) was previously defined as 



^(At) = ([f^(t)-{f^(t))] 2 ) 



1/2 



cx At 



Hi 



This definition is almost exactly the same as Eq. (flO|) 
for Hy, the only difference being that now instead of the 
N number of constituents we consider the time window 
size At as the scaling variable. TFS deals with how the 
variance scales when one moves to stronger (larger (/)) 
signals: 



0i oc (fif 



Eq. (|7|) takes an alternative point of view and suggests 
that for a fixed signal, in the presence of long-range tem- 
poral correlations, the variance can grow anomalously 
also by changing the time window. 

Following Ref. @|, from Eqs. ©) and (J5J), it is easy 
to see that the roles of (fi) and At are analogous. Since 
the left hand sides are the same, one can write a third 
proportionality between the right hand sides: 



At 



Hi 



(fi 



,a(At) 



After taking logarithm on both sides, and differentiating 
by d 2 /d(\ogAt)d(log( fi)), one finds that asymptotically 



da(At) 

d(log(/ i )) ~ d(logAt) 



7- 



(15) 



This means that both partial derivatives have the same 
constant value, which we will denote by 7. 

Eislcr and Kertesz 0] outline three scenarios for this 
equality to hold: 

(I) In systems, where 7 = 0, the exponent a, is inde- 
pendent of window size, and the degree of temporal 
correlations (H) is the same at all nodes. 
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(II) When 7 > 0, a(At) depends on At logarithmically: 
a(At) = a* +7! log At. The Hurst exponent of the 
node also depends on (/) logarithmically with the 
same prefactor: Hi = H* + 7 log (/.;). 

(Ill) It is possible that Eq. (fl5|) only holds piecewise, 
for certain ranges in At. Two regimes are then sep- 
arated by a crossover between two distinct values 
7±, and nodes will have separate Hurst exponents 
H-(i) and H + (i) in the two regimes. 

Case (III) was shown for the stock market and human 
dynamics in Section [2.31 



where we introduced the new parameter (3. 

As a quick check, the three toy models correspond to 
(3 = 0, a = 1/2 (all coins or 1); (3 = 1, a = 3/4 
(the coins value proportional to their number) and (3 — > 
00, a = 1 (only one coin with growing value). There is 
always some (3 > that allows us to reproduce a given 
value a £ [1/2,1), whereas the range j3 < covers all 
possibilities of a < 1/2 and a > 1. 



3.3.2. Impact inhomogeneity 

Any value of a can easily arise without dependence on 
the time window. To better understand the reason how, 
consider three toy systems with the following elements. 

(i) Let us take a fair coin with written on one side and 
1 on the other, this will be our group i = 1. Then 
take two such coins for group i = 2, three for i = 3, 
etc. In every time step we flip all coins in every 
group, and let fi equal the sum of the numbers we 
flipped in clement i. Naturally (fi) cx i and, if all 
coins arc independent, a. t cx i 1 / 2 . Thus, for such a 
case a — 1/2. 

(ii) Now let us take another fair coin with written on 
one side and 1 on the other, this will be our clement 
i = l. For t = 2, we again take only one coin with 
sides and 2. For any i, there will be a single coin 
with sides and i. Trivially (/,) cx i, but also dj cx i. 
So this time a = 1. 

(iii) In our final example, let us mix the above two. For 
the i'th group there are i coins, each having a side 
with and a side with i. Then (fi) cx i 2 , whereas 
Oi cx i 1 / 2 x i. We have just constructed a case for 
a = 3/4. 

One can unify these examples by introducing impact 
inhomogeneity. One can write the contribution (impact) 
of the constituents at a node i as 

Vi, n (t) = (Vi, n ) ■ X itn (t), (16) 

all X U} i(t) are i.i.d. with unit mean. We then allo w ( Vj n ) 
to depend on (Ni) as a power law between nodes f3ltl68T]: 

(Vi, n ) cx (Nif . (17) 
According to Eq. ([9]) fluctuations can be calculated as 

a 2 = Y? Vl (Nt) + T? Ni (Vi) 2 = 
Z 2 X m 2 (N) + (Ni) (^) 2 cx(/,) 2a , 

where T? x = (X 2 ) - (A) 2 , and 

^K 1 + FTt)' < 18 > 



3.3.3. Examples of impact inhomogeneity 

The ecology literature has documented [9(| [sH that 
empirically there is a strong positive correlation between 
the typical size of subpopulations 7 ((V)) and the number 
of subpopulations per unit area ((A)) or the total pop- 
ulation per unit area 8 ((/)). The conjecture that these 
quantities might behave as powers of each other as in Eq. 
(TIT)) was proposed by Keitt et al. [H| , both across species 
and for individual subpopulations of the same species. 

Kendal makes a similar suggestion, and shows that 
it generates non-trivial exponents in EFS for ecological 
populations [3 an d the heterogeneity of blood flow in 
organs [69| . In fact he does not point out the general 
mechanism, but instead refers to non-trivial EFS expo- 
nents as the property of a class of models, which entail 
impact inhomogeneity. Here we will omit most of the 
formalism; a proof that Kendal's approach has impact 
inhomogeneity can be found in Appendix [Bl 

Let us take the case of animal populations as the exam- 
ple. Kendal proposes that EFS holds with an exponent 
1/2 < a < 1 if the population of an area can be described 
by the so-called Tweedie exponential dispersion models 
[92 1 . These assume that (i) an area i contains a Poisson 
distributed number of animal clusters (Ni), (ii) the size of 
individual clusters (Vi. n ) is i.i.d. gamma distributed, (iii) 
and there is a power law relationship between the means 
of these two quantities. Of course, (iii) is the same as Eq. 
(fT?)) . along with all of its consequences. 

As for blood flow [6^| , it is measured by the entrapment 
of radioactive microspheres in capillaries. In a fixed mass 
of tissue, the number of entrapment sites N is assumed 
to be Poisson distributed, while the blood flow V of the 
sites is taken as gamma distributed, along the same lines 
and with the same conclusions as above. 

Finally, Section 12.3.41 suggested impact inhomogene- 
ity also as the origin of non-trivial FS exponents for the 
traded value on stock markets. 



7 This is often called local abundance. 

8 regional abundance 
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3.3.4- Constituent correlations 

There exists a further mechanism to produce any value 
1/2 < a < 1, without considering the scaling property 
of impacts. The total output of node i is given by the 
equation 

N, 
i=l 

We also fix N% as time independent. If we assume that 
the unconditional distribution of V's is independent from 
Ni, and also from n, then one can denote the expectation 
value (V) = (V i>n (t)). 

The central idea is the introduction of correlations be- 
tween constituents, i.e., variables with different n. Let 
us assume for simplicity that the elements are situated 
on a one-dimensional lattice, and their activity is long- 
range correlated in space, so that the correlation function 
decays as a power law, 

C(An) cx (V itn Vi, n+A n) - (Vi, n f cx An 2Hv - 2 . (19) 

Hy is the same Hurst exponent, as defined in Eq. (jTUJ) - 
Then, positively correlated patterns display Hy > 1/2, 
for uncorrelated (or short range correlated) patterns 
Hy = 1/2, and for anticorrelated (antipersistent) pat- 
terns Hy < 1/2. 9 

It follows from Eq. © that the fluctuation of the com- 
bined activity of all constituents is: 

a- = Y, v N l oc (ft) , 

where 

a = H v . (20) 

This idea was (to our knowledge) first presented by West 
[93| . and demonstrated on surrogate data sets, but it was 
not applied directly to any new problem. The role of 
spatial correlations in the formation of FS in the context 
of ecology was also suggested by Colman et al. (iU and 
Ballantyne and Kerkhoff [46| more recently. The idea is 
confirmed by simulations, see Section [4.41 



4. MODELS 

In this section, we will discuss some models that can be 
used to understand basic facts about fluctuation scaling, 
how it arises and what its limitations are. 
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Figure 15: Fluctuation scaling for the random walker model 
on the Barabasi- Albert network with parameters M = 20000, 
W = 100, s max = 100 and T = 10000. The fitted exponent 
is Q?T = 1/2 with a little deviation for very small (/). Points 
were logarithmically binned and logo" was averaged for bet- 
ter visibility, the error bars represent the standard deviations 
inside the bins. The inset shows the values of the effective 
exponents as one increases the fluctuations (E\y) in the num- 
ber of walkers. There is a clear crossover from «t = 1/2 to 
qt = 1. 



4.1. Random walks on complex networks 

4-1.1. The model 

It was proposed by Menezes and Barabasi Q that ran- 
dom walks can generate TFS in the following way. Let us 
take a scale free Barabasi- Albert network 10 of M nodes 
[95l | . We distribute W independent random walkers (to- 
kens) randomly to the nodes. Then, in every time step 
these jump from their current node to one of their neigh- 
bors randomly. The process is repeated for s = 1 . . . s max 
steps, then it is halted and the total number of visits to 
each node i is counted. This number defines fi{t = 1). 
Then the deposition and the walk is repeated, up to T 
times, giving the time series fi(t). We ran simulations 
with the parameters M = 20000, W = 100, s max = 100 
and T = 10000. 

One finds that TFS holds with an exponent ax = 1/2, 
see Fig. [151 This value is the same as what arises from 
sums of independent random variables, so the central 
limit theorem is a possible origin of FS for random walks. 
The next part presents an analytical calculation that con- 



10 The particular topology is irrelevant from the point of view of 
«T • The network only has to be connected and the nodes should 
We need to assume that Vj,« is stationary as a function of n. have a wide range of degrees. 
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firms this conjecture. 

4-1-2. Fluctuation scaling and corrections 

The model can be solved based on a master-equation 
approach [33l . [9(| ■ Here we will use elementary probabil- 
ity theory instead. The number of walkers on node i can 
be calculated from their distribution in the previous time 
step as: 

Ni(s + 1)= ]T *n(7 -*';*). (21) 

j£ld n=l 

where S n (J — ► i; s) is a variable that is 1 if in step s the 
n'th token was at node j and then it jumped to node i 
(happens with probability 1/kj to all neighbors of node 
i), and otherwise, fcj is the degree of node i, JCi is the 
set of neighbors of node i, and Nj(s = 0) corresponds to 
the initial condition. 

Calculations in Appendix [C] show that for such a model 

(fi) = W (Ni) = h =ph, (22) 

where p = s max W/ J2j kj- As the number of walkers 
is multiplied by the s m ax and divided by the total num- 
ber of edges, p can be understood as the mean number 
of walkers passing any edge during the s max time steps. 
Furthermore, 

2 

^ = E % + (*> • ( 23 ) 

The first term on the right hand side is a sum over fc, 
nodes, but every term is multiplied by l/fc|, thus they 
can be neglected to a first order. To a leading order 

°t = (fi) , 

thus we find FS with «t = 1/2. 

The term with the sum presents corrections to the scal- 
ing law. Eqs. (|2"2"|) and (|2"3"|) could be solved numerically, 
but to get a qualitative understanding of these correc- 
tions it is enough to make a self-consistent solution up 
to the first non-trivial order. This can be done by taking 
er| = (fj) = pkj, and substituting it back into the right 
hand side of (j2"3"|) , to find 




where (1/km) is the average inverse neighbor degree of 
node i and a = 1. Simulation results supporting this 
argument are shown in Fig. 1161 We find that this formula 
accounts for a large part of the corrections to FS, only 
the coefficient is different, a sa 3.6. 



The qualitative picture from the above three equations 
is the following. For simplicity let us consider p = 1, 
when the average number of tokens at a node equals 
its degree. Thus on average in every step every node 
transmits one token on each of its edges to its neighbors. 
Consequently every node receives typically one token on 
each edge, so again it will have tokens equal to its degree. 
These tokens arrive independently, thus the variance is 
proportional to their number, which implies ax = 1/2. 
The corrections in Eq. imply that nodes with rela- 
tively higher degree neighbors (smaller (l/k Ni )) exhibit 
lower fluctuations 11 . This is because the number of to- 
kens at a neighboring site with smaller degree is smaller, 
and thus can have larger relative fluctuations. These fluc- 
tuations then affect our site via a stronger variation in 
incoming tokens. 

This argument is important, because it tells us that for 
random walks fluctuation scaling is only approximately 
true. The local topology of the network can give signif- 
icant corrections which cause a broadening in the scal- 
ing plots and which are not simply due to measurement 
noise. According to Eq. (|24[) the size of the correction 
term depends on the neighborhood of the node. Because 
(fi) cx fcj, very large flux nodes also have many neigh- 
bors. In an uncorrelated network the term (1/fcjv,) will 
converge to a constant value with growing fcj, its node 
dependence (and thus the broadening it causes) is di- 
minished. 



4-1.3. The role of node-node interactions and a connection 
with surface growth 

Previously we have shown that (fi) = phi, where p is 
the average number of tokens passing any edge during a 
time step. The fluctuations of the number of visits to 
node i come from two sources: (i) the number of such 
initial tokens at the neighbors, (ii) and how many of the 
tokens at its neighbors continue their walk to node i in 
the next step. The number of tokens at a node is cou- 
pled with the state of its neighbors in the previous step. 
This effective interaction between neighboring nodes is 
the origin of the corrections to FS in Eq. (|2"4"|) . To prove 
this, Menezes and Barabasi suggest a mean-field model 
0] which eliminates this interaction as follows. 

Instead of a direct contact between nodes, let us com- 
pletely disconnect the network, and connect every node 
with its original number of edges to a reservoir. In every 
step (s = 1 . . . S as in the original model) the reservoir 
sends W tokens, their destination is chosen randomly be- 
tween the edges. These tokens return to the reservoir in 
the next step, but simultaneously W new tokens are sent 
out, etc. It is trivial that in this case fluctuations of the 



1 The degree dependence of this correction is related to the assor- 
tativity of the network |97|| . 
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Figure 16: (left) Fluctuation scaling for the random walker model with parameters M = 20000, W = 100, s ma x = 100 and 
T — 10000. The same as Fig. 1151 only without the binning procedure. The scaling law with ax = 1/2 holds on average, but 
there is some systematic broadening. The inset shows 10 randomly selected points from the indicated area, with the average 
inverse neighbor degree (1/fcjvi) indicated for each node. There is a general increasing tendency in (1/fcjvi) from bottom to top. 
(right) The value of a/ </) 1/2 plotted versus the average inverse neighbor degree (1/fcjvi) of the node. There is an approximately 
linear relationship of the form a 2 / (/) ~ 1 + 3.6 (1/km)- 



type (ii) are absent: all nodes are neighbors of the reser- 
voir only, which emits the exact same number of tokens 
every time. Moreover, the distribution of fi(t) will be 
Poissonian with mean and variance pki. Thus exactly 



= (.A> 1/2 = 



(25) 



i.e., qt = 1/2 without any corrections. Moreover, both 
the network topology and the "randomness" of the walk 
was completely eliminated. As suggested by Menezes and 
Barabasi Q , the remaining model is equivalent to a sur- 
face growth problem. Consider a finite one-dimensional 
lattice with J^i ki sites. At every time step W tokens 
are deposited on the surface randomly. The Hurst expo- 
nent of the resulting surface is equivalent to the of 
the non- interacting model [cf. Eq. I|20p], 

This example suggests that FS in the random walker 
model is a mean-field property. The interaction between 
the nodes is only responsible for higher order corrections 
that do not change the scaling exponent in general. Most 
models in the literature are either non-interacting in this 
sense 0,|45|, HH or this interaction is not relevant @,[IH- 
At most, complex dynamics is limited to the structure 
within [37| the nodes, but not between the nodes. There 
exist a few studies of transport models on complex net- 
works where the interaction between the nodes becomes 
relevant. In these models fluctuation scaling breaks down 
and topology-dependent crossovers appear due to conges- 
tion [35| or the presence of multiplicative noise [34[ ■ 



4-1.4- The role of external driving 

Finally, let us briefly remark on the behavior of the 
model in the presence of external driving. One can allow 
the number W of walkers to fluctuate between the times 
t as 

W(t) = (W) + S W x G(t). 

We chose G(t) as i.i.d. standard Gaussians, but the find- 
ings are largely independent of the shape of the distribu- 
tion. If at any time W(t) became less than zero, we set 
it W(t) = 0. If we restrict ourselves to the mean-field 
solution, then at every node Q 



</i) 



(W) 



(fi) 



(26) 



This result implies that when Yiw > 0, there is a 
crossover from «t = 1/2 to ckt = 1 around the node 
strength (/) ~ {W) 2 jY? w . 

The process was simulated with the other parameters 
set as before. With the increase of T,w the best fit to Eq. 
([3]) yields intermediate effective exponents between 1/2 
and 1, see the inset of Fig. [TH However, these are not 
"true" exponents, only signatures of the crossover. 



4-1.5. Impact inhomogeneity 

The random walker model can be modified [33[ to en- 
tail Eq. (fTT|) . This means that when a walker steps 
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Random walk (TFS) 




Figure 17: Fluctuation scaling for the random walker model 
with inhomogeneous impact, parameters M = 20000, W = 
100, s max = 100, T = 10000 and various values of p. The 
case P — is the same curve as in the inset of Fig. 1151 
The q?t (T, w — > 0) limit is well described by Eq. (JTBJ). For 
£w ^> (W) every system displays a crossover to qt = 1. 



onto a site with typically more visitations, it generates a 
higher impact. Since the number of visits is proportional 
to the degree of the node ((Ni) oc fcj), in order to have 
the impact inhomogeneity relationship (Vi) oc (A r ,;)' 3 , one 
can simply introduce that for a token visiting a node of 
degree fcj, the impact should be (Vi) = . Simulation 
results perfectly conform with the theory, aT(Ew = 0) 
depends on j3 as expected from Eq. (fT8]) . The crossover 
persists to = 1 when one introduces a large variation 
in the number of tokens, see Fig. [T71 



4.2. Critical fluctuations and finite size scaling 

The mechanism how (spatial) correlations produce 
non-trivial values of a draws on some fundamental knowl- 
edge in statistical physics. Critical systems are known 
to exhibit anomalous fluctuations due to the presence 
of strong, but non-trivial correlations. These originate 
from the interactions of the internal constituents as for 
e.g. Ising spins. 

It is instructive to consider the simple ferromagnetic 
case, like the nearest-neighbor Ising model [24| on a d 
dimensional square lattice. Because this model does not 
a priori have dynamics, its analysis can be understood 
in the language of ensemble averages. 

The number of spins is N = L d , where L is the linear 
size of the lattice. At the critical point local magnetiza- 
tion has a diverging correlation length, and the correla- 



tion function becomes of the power law form 

C « « ^T~2+V ( 27 ) 

The squared fluctuations of total magnetization [er 2 (M£)] 
are known to be proportional to the susceptibility x, and 
for finite systems both quantities diverge at the critical 
point as 

a 2 (Ml) oc x oc L d+ ->/». (28) 

This is one of the well known results of finite size scaling 
(FSS) [I!]. 

The susceptibility can be calculated as the integral of 
the correlation function: 

NT C L d d r 

x= ^fJ drC(r)cxN J v^* Ld+2 ~ v - (29) 

It is well known that the exponents in E qs. ([28]) and 
(H§ are related, j/v = 2-ij (Fisher's law [Hj]). At the 
critical point, due to the interactions between the spins, 
the susceptibility becomes super-extensive, i.e. it grows 
faster than oc L d 1 a typical sign of criticality. 

Let us now consider an ensemble of finite Ising systems 
at the critical temperature with zero external field and 
with various linear sizes, and let the signal be the Nl i 
number of "up" spins. Of course the total number of up 
and down spins is constant: 

A^.T + N Lll = L d , 

and their difference gives the magnetization as 

M L = N LA -N LA . 

With the notation o(L p )/L p — * 0, at the critical point 

N^ = L d /2 + o(L d ). 

On the other hand, the fluctuations of M and iV-j- are 
proportional, because 

M L - 2N LA - L d , 

and so 

a(N LA f = ^Ml) 2 /2 oc L d + 2 -" + o(L d+2 - r >). 

Consequently, to a leading order, there exists EFS be- 
tween the fluctuations and the mean of the number of up 
spins: 

— 7TT \"2 -— 2cke 

a(N LA ) oc Nl,-\ 

with 

1 2-7i 1 y/v , , 

The above are true up to the upper critical dimension, 
which is d c — 4 for the Ising model (98|. The mean-field 
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results can be recovered by substituting the correspond- 
ing values: d = d c = 4, 7 M f = 1, %f = 1/2, Vmf = 0. 
Finally q;e,mf = 3/4, in agreement with the direct math- 
ematical proof of Ellis and Newman [g9j]. Moreover, at 
the critical point the susceptibility is superextensive, so \ 
must grow faster than L d . This means that in Eq. (j29|) 
d + 2 — r\ > d, and thus 77 < 2. On the other hand if 
77 is non-negative, then from the constraints < r\ < 2 
and d > 1 it immediately follows that 1/2 < q:e < 1. 
This range is also valid for the analogous behavior of all 
n-vector models. 

This result is two-fold, depending on how we look at 

it: 

(i) The exponent «e resembles the finite-size scaling 
exponent of fluctuations/susceptibility. Thus in this 
case fluctuation scaling essentially finite-size scal- 
ing. The difference is that the FS calculation can 
be done even when there is no data available about 
"system size". Instead, because JVf is a positive ex- 
tensive quantity, we know that its expectation value 
will be proportional to the system size, and thus it 
can act as a surrogate variable for L d . An anoma- 
lous value of the FS exponent can be related to criti- 
cal behavior, although - as previous sections suggest 
- not necessarily. We will discuss this question in 
detail in Section [4~3l 



(ii) The finding that when the constituents are long- 
range correlated gives rise to anomalous values of 
a, leaves us with a recipe how to construct simple 
models that display 1/2 < a < 1. The simplest 
scenario is described in detail in Sections 13.3.41 and 



What is the case with iVj off the critical point? In the 
paramagnetic phase the mean number of up spins is ex- 
actly N/2, while the fluctuations are of order N 1 / 2 , thus 
«e = 1/2. The ferromagnetic case is a more delicate 
issue, because the infinite system is not ergodic: spon- 
taneous magnetization is symmetry breaking. For finite 
systems with a local (e.g. Glauber) dynamics it takes a 
finite (but very long) time for magnetization to change 
direction. The phenomenon is more easily interpreted via 
an (unrestricted) ensemble of equilibrium ferromagnets. 
Here still / = N/2, because configurations magnetized 
up and down average out. The fluctuations on the other 
hand are macroscopic, aZ cx 2\M\ cx L d . Thus cke = 1- 
In sum, the paramagnet-ferromagnet phase transition is 
signaled by FS as an abrupt change between the two 
universality classes (similarly to the Satakc-Iwasa forest 
model 0, IHj]). At the critical point one finds interme- 
diate exponents that can be calculated from the usual 
critical exponents. However, it is of fundamental impor- 
tance that the anomalous FS is not observed in the order 
parameter M. Instead, it is observed in an extensive 
quantity, and only whose fluctuations reflect the anoma- 
lous fluctuations of the order parameter. FS is there in 
M , but with a trivial exponent: From finite size scaling 



M cx L d -' 3 / v '. This, combined with Eq. (JUD leads to 
a(M L ) oc M^+i/^/^-P/^V Due to the hyperscaling 
relation 7 + 2/3 = dv this means ole = 1- 

The critical point is a very special state of a system, 
while fluctuation scaling with 1/2 < a < 1 occurs very 
often. To make criticality a viable explanation for these 
non-trivial values of a it is important to notice that 
certain types of dynamics under strong external driving 
can self-organize to their criti cal state without the fine- 
tuning of any parameters @, EM [Toil. Many real life 
systems display the classical signs of self-organized criti- 
cality (such as power- law distributions, long-range corre- 
lations, etc.) and the value of a can help to understand 
the dynamical origins of these observations. 



4.3. Scaling and multiscaling 

Scaling has a fundamental importance in statistical 
physics. It has found countless succ essful applications 
starting with critical phenomena [I02| , but more recently 
also outsid e the classical domain of physics, for example 
in ecology [103| |. In many cases scaling is not bound to 



a specific set of system parameters like in the case of 
critical phenomena, but it is the generic behav ior o f the 
system as in polymers jl04| . surface growth [l05| | and 
self-organized criticality [2| . Mono-scaling or gap scaling 
means that the probability distribution of a quantity / 
depends on the parameter L, usually the system size, as 



L) = r i F 



J_ 



(31) 



where F is a scaling function and $ is some constant. 
This form can account for a number of observations about 
power law behavior in real systems. 

Both gap scaling and fluctuation scaling characterize a 
large number of complex systems. Nevertheless, for the 
same quantity only one can be true except in a special 
case: If a quantity shows both gap scaling and fluctua- 
tion scaling, then this automatically implies a = 1. One 
can reverse this argument: If for a quantity one finds 
fluctuation scaling with a < I then it cannot exhibit gap 
scaling. 

The proof is straightforward. Any moment of / can be 
calculated as 



fl = 



dff q 



fo 



»(f,L)~K q L<>*, 



(32) 



where "~" denotes asymptotic equality and K q > 0. 
From Eq. ^) it follows that 



°\=!1-Tl 



K 2 L 2 * - KfL^ = (K 2 - Kf)L 



-2 r 2* 



2\ t2<S> 



(33) 



We combine EFS and Eq. (|33|) , eliminate L and find that 
now er-£ cx jl , i.e., a = 1. 
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The only possibility for the coexistence of gap scaling 
(f3~Tj) and fluctuation scaling ([5]) with a < 1 is when the 
constant factor in the variance vanishes: 



(K2 - Kl) 



0. 



In this case the gap scaling form does not describe the 
variance, that is instead given by the next order (cor- 
rection) terms. Nevertheless, even if it is so, the leading 
order of the variance is still zero, and consequently F is 
proportional to a Dirac-delta: 



J_ 



This case is pathological, and it is usually not considered 
as scaling. In fact, the previous section contained one 
such example: The number of up spins in a critical Ising 
model follows this sort of statistics. Fluctuations scale 
anomalously [a\ cx L d+2 ~' n ), whereas their leading order 

vanishes because f 2 — f — L 2d /4. Such strange scaling 
arises as a sign of criticality when the scaling variable is 
an extensive quantity, for which only the fluctuations are 
connected to those of the order parameter. 

For example, in ecology there do exist species with 
awl, for which a gap scaling form of the probability 
density of / could be valid. However, this value is by 
no means universal (cf. Fig. [I}. Similarly, a < 1 was 
observed for Internet router traffic Q or the traded value 
on stock markets [Hsj . These quantities cannot have a gap 
scaling form. 

Instead of gap scaling, one can assume multiscaling, 
but the results do not change crucially. A probability 
distribution shows multiscaling if its size dependence is 
of the form 



hi 



L)/ HL/L ) = -FMf/fo)/ HL/L )} , (34) 



where /o and Lq are appropriately chosen constants. The 
moments can be calculated by expressing the density 
function from Eq. (j34|) and substituting into the defi- 
nition 



f q nf:L)df 



o V^o 



-F[ln(///o)/ln(L/L )] 



fS 



£ \ <? a (<?) / t \ -fMq)] 



K n L T{q \ 



The usual approach is that the value of the integral is 
dominated by the point /* (q) where the integrand is max- 
imal. Then 



a(q) 



Hf*(Q)/M 



and 



HL/L () ) ' 
r(q) = max.[qa — F(a)], 



or cquivalently = a. Now we are back at the same 
situation as with gap scaling, since 



and 



K 2 L T W - KfL 2 ^. 



One expects that r(2) > 2t(1), because the variance 
must remain non-negative for arbitrarily large L. If 
t(2) > 2r(l) then the first term dominates a 2 Ll and 
a = 2t(i) ' but ^ ms vame is greater than 1. For exam- 



ple Tebaldi et al. [1061 ] report that in the Bak-Tang- 
Wiesenfeld sandpile model of L linear size, the distribu- 
tion of the number of topplings / in an avalanche follows 
Jl oc 17^ with r(l) w 2, and r(2) « 4.7. This results 
in an a w 1.17. 

The other possibility is again r(2) = 2r(l), and a = 1 
(unless the leading order terms in a 2 compensate to zero). 
This solution offers nothing new compared to gap scaling. 
Such relationships can be seen, e.g., in the very same 
BTW mod el for the distribution of the area affected by 
avalanches [l06j |. 

The conclusion: If a quantity shows gap scaling with 
a scaling function which is not fully degenerate (not a 
Dirac-delta), it must follow a — 1. If there is multiscal- 
ing, then fluctuation scaling with a > 1 is also possible, 
but such values are rarely observed and should be taken 
with care. 



4.4. Binary forest model 

In this section we introduce a toy model that can be 
used to better illustrate the ideas of Sections 14.2114.31 
Moreover, we will show that those are in full analogy with 
the findings of Scction [2.3.2l for the reproductive activity 
of trees. For an easier understanding we will present the 
model in that language. 

Let us consider a forest that consists of N trees. For 
simplicity we also assume that these are situated on a 
one-dimensional regular lattice, but any higher dimen- 
sional generalization is straightforward. In the year t the 
reproductive activity (i.e. seed count) of every tree n 
is characterized by a random variable V n (t). Again, for 
simplicity we consider V's as binary variables, which are 
1 with probability p and with probability 1 — p. Be- 
cause it takes several years for a new tree to reach its 
full reproductive capabilities, given that the observation 
period is short enough, we can neglect the changes in N 
due to seed production and tree growth. 

The year-to-year correlations in seed counts are ne- 
glected. On the other hand, it is known [48| that the 
reproductive activity of forests exhibits long-range spa- 
tial dependence, with significant positive correlations for 
distances of thousands of kilometers. The distance de- 
pendence can be fitted approximately by 



C(An) (x (V n V„+An) - (V n f oc An 



2H V 
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see Section 12.3.21 The total seed count is given by the 
usual form 

N 

!n = Y. v - 

The standard deviation of the sum of random variables 
correlated according to Eq. (fT^j) scales as 

<?n = (f 2 N ) - (In) 2 k N H - 

with Hy being the Hurst exponent [cf. Eq. (fT0|) ]. 

whereas 

(f N ) = PN. 

The two equations can be combined into TFS with 

a T = Hy, J20]) 

To the careful reader it should be clear that almost 
the same model was discussed in Section 14.21 There we 
argued that in a critical Ising model whether any given 
spin points upwards (1) or downwards (0) is essentially 
a binary random variable with p = 1/2. Moreover the 
spin alignments are power-law correlated in space, such 
that the power of the decay is related to the FS exponent 
a. This was expressed by Eq. (|30|) . which is essentially 
equivalent to (f2"TJ)) . The binary forest model only differs 
from the Ising case in that correlations between the ran- 
dom variables are given a priori, and not generated by 
the thermodynamics. 

Now we can move on to simulation results. 12 Fig. [TSl 
shows the dependence of a n on N and (/), the two plots 
are basically equivalent due to (/) = pN. Fig. [PWlcft) 
illustrates that the fluctuations in systems of the same 
size increase rapidly with Hy. This is due to a strong 
synchronization of the individual constituents [see Fig. 
[fright)]. The relationship (|20]) is illustrated in Fig. [20] 

5. DISCUSSION 

In this section we present our view about unsettled 
questions related to fluctuation scaling. We also discuss 
recent, sometimes controversial techniques that might 
help the understanding of FS. 



Except for the trivial case Hy = 1/2 (when V's are not strongly 
correlated) the above model is not very straightforward to sim- 
ulate. We generated a one-dimensional fractional Brownia n mo - 
tion time series by applying the method of Koutsoyiannis Il07ll , 
and then converted it into a sequence of O's and l's 13 . The con- 
version slightly decreases the value of the Hurst exponent, which 
thus had to be measured independently by Dctrcnded Fluctua- 
tion Analysis [6lj | . For simplicity, we fixed the number of trees, 
because the effect of externally imposed noise {Y? N > 0) has 
already been studied in detail in Section 13.2.21 and, e.g., Refs. 
[lH for other models. 



5.1. Separation of global and local dynamics 

In Section 13.21 we argued that a system whose internal 
dynamics can be mapped onto the central limit theorem 
displays fluctuation scaling with a = 1/2. On the other 
hand, if one imposes a strong external driving to the sys- 
tem, the behavior crosses over to a = 1. One example 
was shown in Section 14.11 in the case of random walks on 
complex networks. There the fluctuation was given by 
Eq. (|26|) . which has the structure 

°i = (fi) + A* (fi) 2 i (35) 

where A is proportional to the strength of the external 
driving. If A <C 1 one finds a = 1/2, whereas in the 
strongly driven limit A > 1 the first term is negligible 
and a = 1. 

Now assume that we do not know the strength of ex- 
ternal driving and we want to approximate it from data. 
We can introduce the global activity F(t) of the system 
as a sum over all constituents: 

N 

F(t)=J2M)- (36) 

i=l 

Ref. [3l| suggests that if our system has many elements, 
then F(t) will be proportional to the external force, be- 
cause the independent fluctuations of the elements aver- 
age out in the sum (|36p . and what remains is only the 
factor of the common external driving. This argument 
implicitly assumes, that the external force contributes to 
the fluctuations of the elements in a coherent way, i.e., 
fi(t) can be written in the form 

f i (t)=ft\t)+fr t (t), (37) 

where 

fr\t) = AiF{t). (38) 

This formula is a form of linear response, where: (i) Ai 
is not allowed to depend on time because of stationarity. 
(ii) More importantly, all nodes are affected by driving 
instantaneously or with the same constant time lag. 

After the summation of Eq. (|36|) we find that it is 
consistent with Eq. (f3"7| . if the normalization condition 
Ai = 1 is satisfied. In order to keep Eqs. (f3"S"|) and (|3~7| 
consistent in the strongly driven limit, the only possible 
choice is 

A t = j|j. (39) 

By this definition automatically (/- nt ) = and (/° xt ) = 
(fi). All time series have finite standard deviations, 
which are defined in the usual way, for example of = 
y/(F 2 ) - (F 2 ). With these 

= ?& (fi) , (40) 
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Figure 18: (left) Scaling plots of a versus N generated by Detrended Fluctuation Analysis of V„(t) in the binary forest model. 
The slopes correspond to the (spatial) Hurst exponents Hv ~ 0.5, . . . ,0.95 from bottom to top, see Eq. (|10|l . (right) Scaling 
plots a versus (/) for FS in the same data. The slopes correspond to the values of «t ~ 0.5, . . . , 0.95. 




Figure 19: (left) Examples of f%(t) time series for a "forest" with N = 300 "trees". The Hurst exponent Hv between the trees 
was varied: Hv ~ 0.5,0.65,0.8,0.95 increasing from bottom to top. The data were shifted by the addition of a constant, 
but they were not stretched in any way. One can see that due to the increasing synchronization of the constituents, relative 
fluctuations increase rapidly, (right) Snapshot of Vi, n series (at a fixed time i) for a forest with N = 300 elements. The data 
were shifted by the addition of a constant. The Hurst exponent Hv between the elements was varied: Hv ~ 0.5, 0.65, 0.8, 0.95 
increasing from bottom to top. Spatial synchronization increases with the growth of the Hurst exponent. 



so the external component follows FS with a = 1 in any 
system. This appears consistent with the fact that in 
strongly driven systems / itself also shows a = 1, not 
only the external component. However, this is in fact 
just a trivial consequence of how the external component 
was defined. 



Rcf. 



3l| | calls the process of assigning the internal and 
external components "noise separation", and claims that 
the procedure works well for the random walk model. For 



each node they define a noise ratio 



which is zero in the absence of external driving, and large 
when the fluctuations of the external component are dom- 
inant. 

It would be tempting to attribute the real world ob- 
servations of a ~ 1 to external driving, and show that 
in these cases typically 77 3> 1. However, we will demon- 
strate on some examples that noise separation has strong 
limitations. 



2G 



Binary forest model (TFS) 




Figure 20: The equality oe = Hv (dotted line) in simulations 
of the binary forest model. The measurement points align 
very closely to the line, with some statistical deviations. 



Ref. [3l[ finds that the fluctuations of Internet ( Abiline 
backbone) traffic show TFS with a = 1/2 and typically 
r\ ~ 0.1. While this appears very convincing, a more 
detailed analysis of a subset of the same data [32| instead 
finds a = 0.7 — 0.8. The latter study suggests congestion 
as the origin of the increase value of a, and does not 
assume any external driving force. 

Ref. 0] reports, that river level fluctuations fall into 
the class a = 1. It seems plausible that water levels 
fluctuate due to rainfall on the river basin, which can be 
understood as external driving. However, noise separa- 
tion is impossible here, because the driving is not coher- 
ent. The global factor F(t) = J2i /»(*) i s meaningless, 
because the response times of the water level, and the 
timing of precipitation vary from river to river. Hence 
Eq. K35J) is not valid. 

To our knowledge, our study [56[ was the first to re- 
veal fluctuation scaling in the trading activity of stocks. 
Noise separation was carried out there, finding that the 
typical value of r)i increases with the time window At. 
Because a also shows a similar tendency (cf. Fig. [5]), 
we suggested that external driving must play a role in 
this effect. We also argued that this is because informa- 
tion needs a finite time to spread on the market. On the 
scale of a few minutes the role of external information 
is small and localized, whereas on the long run trading 
is dominated by the external macroeconomic trends and 
news. 

Later we proposed a much simpler explanation Q, 
which was also summarized in Sections 12.3.41 and 13.3.11 
of this review. In the stock market (and human dynam- 
ics, see Section 12.4. 2p one observes, that for long times 
a{At) = a* + 7 log At and H, = H * + 7 log (/;) with 
some 7 > 0. These laws have not yet been related to 



any external force, even though the possibility cannot be 
ruled out. 

How would noise separation work in this case? 

1. Clearly op cx At Hp , with Hp sa maxjiij, because 
F is the sum of all fi's, and the scaling of the sum 
is dominated by the highest Hurst exponent. 

2. erf** cx up cx At Hp . On the other hand, if 
cr° xt < trj nt , then one expects that qualitatively 



- int cx At H - 



3. Thus the ratio rji = crf xt /<rj nt should typically grow 
as long as r\i < 1. This observation of Ref. [56[ is 
hence no proof of any particular external influence. 

While to present further calculations is not the purpose 
of this review, we believe, that rj ~ 1 can arise from 
spurious effects. A value r\i > 1 consistently, for many 
nodes has only been observed in a single study where 
rj ~ 1.5 [601 ]. Our present understanding is that noise 
separation has a limited range of applicability. 

Finally, we would like to point out that to identify 
the ensemble average ([56")) with some external force is 
somewhat controversial. Since we do not have any infor- 
mation about the origin or the physical meaning of the 
factor F{t), it is probably more appropriate to call this 
and /j ext 's global and not external factors. Accordingly, 
/j lnt 's are better called local, rather than internal factors 
when it is unknown how much they represent internal 
processes at the nodes. 



5.2. Limit theorems for sums of random variables 



In Section [3.2.1l we briefly remarked on the connection 
of a = 1/2 to the central limit theorem. We recall that 
/ is written as a sum over the constituents (other ran- 
dom variables) whose number N we will consider as time 
independent: 



N 



Let us assume that a general form of central limit theo- 
rem is applicable, so that 



N a Z v 



(41) 



where X is some random variable and "— >" means con- 
vergence in distribution for N —> 00. In the language of 
FS the same equation reads 



£n=lK-(/) 

K{f) a 



x, 



(42) 



where K is & constant. The conceptual difference is 
only that since we know that (/) = N (V), we can use 
(/) as a surrogate variable for N. 
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This analogy tells us that the appearance of FS 
throughout disciplines might be due to the generality of 
certain limit theorems. The trivial example is of course 
that of i.i.d. variables with positive mean and finite vari- 
ance, leading to the value a = 1/2, but there are several 
other cases. 

If the V's are i.i.d., but their distribution decays 
asymptotically as P(/) oc /~ (A+1) with < A < 2, then 
the Levy-Gnedenko central limit theorem 14 is applicable 
(87j |. That is in spirit similar to Eq. (|Hj). with a = 1/A. 
The difference is that a, and if A < 1 even (/) is infinite. 
However, for N < oo they will have some finite effective 
value, which can show apparent fluctuation scaling with 
some non-trivial value of a. 

Contrary to the relative simplicity of independent 
(and possibly identically distributed) random variables, 
dependent variables can be extremely diverse. They 
have no general theory, and the number of universality 
classes/limit theorems is infinite. Their structure is not 
always fully described by pairwise correlations and Hurst 
exponents (cf . Sections 14. 2114. 4p . In these cases some- 
times there exists no limit distribution, or, e.g., a < 1/2 
or a > 1 in Eq. flUJ) (To^ . 

Even for the usual 1/2 < a < 1 values there is only 
a limited set of results, here we only mention a few in- 
spired by statistical mec hanics. In a series of papers Ellis, 
Newman and Rosen j99l . IlloL llll| show that in some sta- 
tistical mechanical systems physical quantities can obey 
Eq. (|4ip with a = 1 — l/2fc, where A; is a non-negative in- 
teger. For example, in the Curie- Weiss mean-field model 
the number of up spins obeys k = 2 and a = 3/4 at 
criticality, and the distribution of X can also be given 
explicitly. In Section 14.21 we arrived at the same expo- 
nent using heuristic arguments. Baldovin and Stella [112j 
recently published some more general results on a mean- 
field theory of strongly correlated random variables. In 
their model fine-tuning the strength of correlations allows 
for any 1/2 < a < 1, much in the spirit of Section |4~41 
and Ref. 

5.3. The connection of ensemble and temporal 
averages 

Let us now return to the connection between the TFS 
* (At) oc {h) aT , © 

and the EFS 

WW(At) oc 7W QE © 

laws. These correspond to two definitions of the statisti- 
cal quantities: (i) for Eq. © the mean and the standard 



In fact the conditions of the Levy-Gnedenko central limit theo- 
rem are somewhat looser. 



deviation are calculated as temporal averages; (ii) for Eq. 
© they are calculated on an ensemble of subsystems of 
the same size. 

For the mere existence of such quantities it is necessary 
to assume that: (i) signals with the same mean have the 
same statistical properties, and the processes are station- 
ary; or (ii) systems of the same size can be considered ele- 
ments of the same statistical ensemble, which is a kind of 
a homogeneity condition. In real systems neither of these 
conditions holds exactly, but they often prove to be good 
approximations. A deviation from these assumptions is 
one possible source of the observed broadening of the 
scaling plots. For example, in the case of precipitation 
data in Section 12.4.31 we found that mean precipitation 
is not the only determinant of the amplitude of fluctua- 
tions. Areas with the same mean precipitation are not 
equivalent, because they can correspond to very differ- 
ent climates. Factors such as height and geographical 
position are also relevant. 

A related concern is the presence of correlations 1 1 1 3j ] . 
The observations may be correlated in space or time, (i) 
Two nodes (e.g., populations or weather stations) which 
are located close to each other can have significant cross- 
correlations. Fits can be biased, because the observa- 
tions are not independent, (ii) The signals of individual 
nodes can have strong temporal autocorrelations, which 
can amplify statistical errors when the time series are not 
long enough [82j . 

A more delicate question is the connection between the 
two types of FS, which has been very vaguely investigated 
in real systems so far. First of all, the two factors cannot 
be separated completely. McArdle et al. [8l| point out 
this problem through the example of animal populations, 
(i) The measurement of the number of individuals in an 
area takes a finite time. There is an in and outflow of 
individuals, so the number fluctuates. Thus temporal 
dynamics can affect the results, (ii) If we want to measure 
the time series of the size of a given population, we have 
to assign a spatial scale as what to consider a population. 
The temporal dynamics will depend on this spatial scale 
of sampling, possibly in a non-trivial way. 

Taylor and Woiwod [II] conducted a very large scale 
study of the two (temporal and ensemble) FS laws in 
animal populations. A systematic comparison is possible 
when the same sites sampled are at the same time f8lj . 
Taylor and Woiwod calculated the temporal and spatial 
means and standard deviations of the abundance of some 
aphids, moths and bird, then calculated ax and «e for 
each species. 

First of all, they found that the temporal and ensem- 
ble means of population differ significantly. Thus it is not 
surprising that the values of a differ as well. There was 
absolutely no systematic relationship between q?t and 
oe, and even the same species can show several such val- 
ues depending on its natural environment. Rather inter- 
estingly, the only systematic dependence between species 
is the presence of positive correlations between the value 
of a's, and average population size. For example, for 
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temporal data this means that across species «t is corre- 
lated with 1/M J2iLi (/*)• The correlations are present 
in both cases, although st rong er for the temporal vari- 
ant. Taylor and Woiwod [88| also suggested, that the 
interactions between individuals might contribute to the 
ensemble law more than to the temporal one. 

In some studies such as ecology or climatology the def- 
inition of the spatial scale comes naturally. Still, most 
systems have some hierarchical structure on which a de- 
gree of aggregation is possible. For example, it is possible 
to analyze the fluctuations of Internet traffic at the au- 
tonomous system level instead of the router level, which 
might have a different dynamics. On the stock market 
TFS holds not only for individual stocks, but also when 
we consider the trading activity of industry sectors [49j |. 

In sum, the relationship between ensemble and tempo- 
ral fluctuation scaling is rather unclear in real systems. 
The exponents «e and ax are seldom calculated for the 
same system, and when they are calculated, they have 
different values. 



5.4. Fluctuation scaling for growth rates 

To be able to interpret FS for temporal fluctuations 
one has to assume that the underlying system is station- 
ary. For example, in the binary forest model of Section 
14.41 we assumed that the number of trees is constant and 
we neglect the contribution of reproduction to the pop- 
ulation. To depart from stationarity, we can consider a 
growing population of Ni(t) individuals, all of which can 
reproduce at a time t [Vi, n (t) = the number of offsprings], 
die [Vi :n (t) = — 1] or do nothing [V^ n (t) = 0]. The popu- 
lation can now be written as a sum 

Nt(t) 

N t (t + 1) = Ni(t) + J2 V i,n(t)- 
n=l 

Ni (t) is obviously not stationary, because its distribution 
depends on its value in the previous time step. Never- 
theless, one can still construct something similar to TFS 
by using a restricted ensemble average as follows. 
Let us define the growth rate of a population as 

f i (t)=N i {t+l)-N i (t). 

Now let us make an ensemble of growth observations 
when the initial population was N. The growth rate in 
this restricted sample is given by 

N 

fN = Y, V «- 

n=l 

Because / can be negative, to postulate TFS for size 
dependence it is more convenient to use N and not (/) 
as the scaling variable. We conjecture 

°n = (f 2 N ) - (f N ) 2 * N a (43) 



in the spirit of the previous sections, and this is exactly 
what is found in many systems. 

Keitt and Stanley show [67], [68| that the growth rate 
fluctuations of animal populations scale as a non-trivial 
power of the initial population N. The finding is not 
specific for population growth, but occurs in many set- 
tings where a positive quantity fluctuates by the addition 
and subtraction of increments. The same behavior was 
found by Lee et al. [62j for the growth rates of business 
firms, and Amaral et al. [g3| even presents a model of the 
complex structure of the business growth process which 
predicts the correct exponent. 

Janosi and Gallas [6J| criticize these results, and show 
the same distribution of growth rates and the scaling law 
(|4"3"|) for the water level fluctuations of the river Danube, 
which trivially must have a structure that is very different 
from business firms. Moreover, they show that the daily 
absolute change of water level scales with the average wa- 
ter level on the same day, and there is a clear crossover 
behavior between two scaling regimes with a = 1/2 and 
a = 1. A related study by Dahlstedt and Jensen [6f| 
estimates a w 0.9 — 1, and suggests that FS can be de- 
composed into two distinct scaling laws: <ja oc A a and 
(Ja) oc A b , where A is the area of the river basin. 

From the above it is clear that the size-dependent scal- 
ing of growth rate fluctuations is a variant of fluctuation 
scaling for nonstationary (growing) populations. The 
same formalism can be applied in both cases, and many 
results could be mutually applied. 



6. CONCLUSIONS 

The aim of this review was to provide a broader per- 
spective on Taylor's law and fluctuation scaling, and to 
encourage the collaboration between disciplines where 
these phenomcnona are observed. We also outlined a 
classification scheme on the meaning of the FS exponents. 
The main conclusion is that several types of mechanisms 
can lead to the same value of a. A similar concern was 
formulated in the 1982 paper of Taylor and Woiwod [88| : 

"Extrapolation of dynamic principles from [...] obser- 
vation is likely to be misleading. We find great differ- 
ences between [species], but the overlap is also very large. 
Whilst it is improbable that the details of [...] behaviour 
in a bird and an aphid would be alike, there are common 
elements in the [...] structure of their populations." 

While fluctuation scaling alone is not enough to iden- 
tify the underlying dynamics of a system, it is useful 
for excluding some possibilities, and for rejecting certain 
models which would generate unrealistic a's. Empirical 
data from virtually all fields of science display fluctuation 
scaling, and so it is possible to make statements about 
almost any system where such data is available. For this 
very reason, in order to deepen our understanding of the 
phenomenon, it is becoming increasingly important to 
bridge the gap between several disciplines. But the most 
puzzling question still remains: Why do, for example, 
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email traffic, stock market trading and the printing activ- 
ity in a computer lab behave in similar, non-trivial ways? 
Some insights can be gained from the time window depen- 
dence of a. That can reveal whether on some time scale 
the behavior of the system reduces to something simpler 
or possibly trivial. One can also make efforts to decom- 
pose the signals into well-defined constituents, so that a 
mean-field model based on sums of random variables can 
be applied. We believe that a possible common origin 
of all fluctuation scaling laws is the generality of these 
underlying mean-field type mathematical structures. 
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Appendix A: THE COMPONENTS OF THE 
FLUCTUATION a 2 

A large part of this review is concerned with the stan- 
dard deviation of the sums of random variables. This is 
defined as 



a 2 =J2 P ( N ) [ dV N P(V N )V] 
[J>(iV) / dV N P{V N )V N ) 



N 



N 



dV N P(V N )V 2 - / dV N ¥(V N )V N 



(Vn) 2 



£2 =JVK2 



^P(iV) ( fdV N F(V N )V N 



(V N ) 2 =N 2 (V) 2 
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N 



^P(iV) / dV N P(V N )V N 



4^P(iv)Jv- 
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(Vn)=N(V) J 



(N) 



(vy 



E P{N)N 2 - I E P(N)N 



N 



N 



(N 2 



(N) 2 



Thus finally 



= E 2 , (N) + (V) 2 £ 



In the case when the V n 's are strongly (i.e., power law) 
correlated E 2 ^ = Y^N 2 ^ where h 
nent as defined in Eq. (fTT)|) , and so 



T,yN 2Hv where Hy is the Hurst expo- 



a 2 = Y 2 v (N 2H -) + {V) 2 T 2 N . 

The correlations in TV are not reflected directly in this 
expression. Instead, they affect how E^v changes with 
the time window size At as pointed out in Section [3.3.1l 




where V n are the individual (not necessarily independent) 
random variables, and N is the number of these variables 
which itself can be random. 

Let P(iV) be the probability that the number of vari- 
ables is N. The sum of N variables can be written as 
Vn = J2n=i -^et P(Vjv) denote the density function 
of this sum when N is fixed. Then the standard deviation 
of the sum when N itself is a random variable is 



Appendix B: TWEEDIE MODELS AND IMPACT 
INHOMOGENEITY 



In this appendix we prove that origin of the non-trivial 
a values in the formalism of Kendal [H, IfiS! is essen- 
tially due to impact inhomogeneity. Kendal's formalism 
is based on the so-called Twccdie exponential dispersion 
models 92] . These form a family of random distributions, 
characterized by the logarithmic cumulant function (see 
Ref. [13, p. 1516) 



K}( S ) = ln{e s f) f = \olg e ( S )-l} 



(Bl) 
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where s is a constant and / is the random variable. We 
use natural logarithms (In) , as opposed to other parts of 
this review, where we used 10- base logarithms (log). We 



also introduced the notation 
terms above are 



A = 



a- 1 



and 



9e(s) 



k6 



1 + 4 



f™dye xy . The two 



(B2) 



(B3) 



> and a < 

As pointed out both by Kendal [HI and Bar-Lev and 
Enis [92|, the form (|B1[) is characteristic of compound 
Poisson processes [83] ■ These are distributions of random 
variables of the following type: 



N 



/ = E^> 



where N is Poisson distributed, and V n are i.i.d. random 
variables. The proof is straightforward, however, we in- 
clude it here for completeness. The density function of a 
compound Poisson variable / can be written as a com- 
plete probability 



P(f) = E P(f\N)P(N). 



N=0 



The characteristic function is given by 
(e sf ) f = / dfe*fp(f) = 

OO „ 

£ P(N) / dfe g fp{f\N) 

oc „ 

]T P(N) / dfe s fp{V! + V 2 + .. 

N=0 



V N ) 



OO 

E 

N=0 



N 



P(N)(e* v ) v 



For the last equality we used the property of the charac- 
teristic function that (e s ^<" Vn } = (e sV ) . Then, know- 
ing that if N is Poisson distributed with mean (N) then 
its characteristic function is (e tN ) = e^ N ^ e —1 \ 



N 



N=0 



N 



; {JV)((exp( s V)) v -l) = ^efj 



(B4) 



The next step is to compare Eqs. (|B2|). (|B3j) and (TBI)) . 
One finds that for the Tweedie model 

(e° v ) v = 9e (s) = (l+ S -y. 

This is the characteristic function of a gamma distribu- 
tion. Its moments can be determined as usual: 



(V) 



(v 2 ) 



— (e sV ) 
ds K )v 



r) 2 

— (e sV ) 

ds 2 K /v 



a(a- 1)0" 



.5=0 



V% = (V 2 ) - (V) 2 = -a6~ 2 . 
For the expectation value of the Poisson variable: 
a - 1 / k9 



(N) = A = 



ka V a — 1 



Furthermore, 



(/> = 



and 



= Xa6-\ 



Xa(a - 1)0" 



Let us recall, that also A contains terms with 0. Simple 
calculation yields 

a 2 = k(f) 2a , 



where 2a = (a — 2) /(a — 1). Consequently: 

(i) Let us introduce (3 = — 1/a. Then 

(V)^(Nf, 
which is impact inhomogeneity. 

(ii) Moreover, a oc (/) Q with 

i (_ a 



= 2 l 1 ' 8+1 



exactly the same relationship as in Section f3. 3. 2 1 



Appendix C: FLUCTUATIONS IN THE 
NETWORK RANDOM WALKER MODEL 

This section contains calculations starting from the 
master equation (|2ip . The total number of visitations 
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to node i is the sum over all steps. The substitution of 
Eq. ([2T]) gives 

f i= J2Ni(s)= E E E 8 n (p^i;s), (CI) 

s=l s=0 jeKi n=l 

where S n , s (j — > i;s) is a variable which is 1 if the n'th 
token of node j in step s jumps to node i (happens with 
probability l/kj), and otherwise, fcj is the degree of 
node i, JCi is the set of neighbors of node i, and Nj(t = 0) 
corresponds to the initial condition. 

For any finite network one can switch the order of the 
first two sums, and in 

Smax S m „ x -1 Nj(t-l) 

/., = E^(s) = E E E -*";*) 

s=l jGKi s=0 n=l 

if 5* is taken large and because for any fixed n the vari- 
ables S ntS (j — > i) are independent, then due to the central 
limit theorem the last two sums converge to independent 
Gaussians: 



/, = E^w = E 

je/Q 



s=l 



(C2) 



where ^ (s) are i.i.d. standard Gaussians such that 

{Qi(s)gj(r)) = S^, (C3) 

where the right hand side has two Kronecker-deltas. Con- 
sequently 

(/<(*)/,•(*)> = (/*(*)) . when i ? j. (C4) 

One can take the expectation value of the left hand 
side of Eq. (C2]). Finally, 



ifi) = E 



(N 3 ) 



By substitution one can check that the solution is 



(fi 



(Ni) = ki 



(C5) 



(C6) 



and all the walkers are accounted for: Yli (fi) = SmaxW- 
Now let us calculate the standard deviation for both 
sides of Eq. |IC2J>: 



'■Vim X (Nj) \ I ^ ls max (Ni) 
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One can use Eq. (|C4p to write 



'Mil ax 



4ax (A^> <JVl> 



(c) = 0, because of Eq. (Qi) = 0. 



ue/c, 



because of (|C3[) . By changing a summation variable, one 
can write 



(«)= E 



«Lx W) W) 
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Combining all the above, one gets 
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(/) and (g) and (ft.) combine to 



E 



'max \ iy j 



E 



'max x 1 y j 



(N 3 f 



E 



fl. ■ / Aj - A/T 



E 



'max V v 3 



( Nj 



5 max \ iy j 



(a) can be replaced by 



, because (Qj) = 0. 

2 



E 



Then, 

^2 _ Zl _|_ Smax 

jG/c, i je/Ci 3 
The second term can be evaluated from Eq. (|C5j) , to find 



(C8) 



(6) 
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